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Abstract 



This is the report- version of a mini-series of two articles ||60| , plf on the foundations 
of conjunctive normal forms with non-boolean variables. These two parts are here 
bundled in one report, each part yielding a chapter. 

Part I We consider the problem of generalising boolean formulas in conjunctive 
normal form by allowing non-boolean variables, with the goal of maintaining com- 
binatorial properties. Requiring that a literal involves only a single variable, the 
most general form of literals are the well-known "signed literals" , corresponding to 
unary constraints in CSP. However we argue that only the restricted form of "nega- 
tive monosigned literals" and the resulting generalised clause-sets, corresponding to 
"sets of no-goods" in the AI literature, maintain the essential properties of boolean 
conjunctive normal forms. In this first part of a mini-series of two articles, we 
build up a solid foundation for (generalised) clause-sets, including the notion of 
autarky systems, the interplay between autarkies and resolution, and basic notions 
of (DP-)reductions. As a basic combinatorial parameter of generalised clause-sets 
we introduce the (generalised) notion of deficiency, which in the boolean case is 
the difference between the number of clauses and the number of variables. Autarky 
theory plays a fundamental role here, and we concentrate especially on matching 
autarkies (based on matching theory). A natural task is to determine the structure 
of (matching) lean clause-sets, which do not admit non-trivial (matching) autarkies. 
A central result is the computation of the lean kernel (the largest lean subset) of a 
(generalised) clause-set in polynomial time for bounded maximal deficiency. 

Part II Concluding this mini-series of 2 articles on the foundations of gener- 
alised clause-sets, we study the combinatorial properties of non-boolean conjunctive 
normal forms (clause-sets), allowing arbitrary (but finite) sets of values for variables, 
while literals express that some variable shall not get some (given) value. First we 
study the properties of the direct translation (or "encoding" ) of generalised clause- 
sets into boolean clause-sets. Many combinatorial properties are preserved, and 
as a result we can lift fixed-parameter tractability of satisfiability in the maximal 
deficiency from the boolean case to the general case. Then we turn to irredundant 
clause-sets, which generalise minimally unsatisfiable clause-sets, and we prove basic 
properties. The simplest irredundant clause-sets are hitting clause-sets, and we pro- 
vide characterisations and generalisations. Unsatisfiable irredundant clause-sets are 
the minimally unsatisfiable clause-sets, and we provide basic tools. These tools al- 
low us to characterise the minimally unsatisfiable clause-sets of minimal deficiency. 
Finally we provide a new translation of generalised boolean clause-sets into boolean 
clause-sets, the nested translation, which preserves the conflict structure. As an 
application, we can generalise results for boolean clause-sets regarding the hermi- 
tian rank/defect, especially the characterisation of unsatisfiable hitting clause-sets 
where between every two clauses we have exactly one conflict. We conclude with a 
list of open problems, and a discussion of the "generic translation scheme" . 

Keywords: generalised clause-sets, signed formulas, non-boolean variables, sat- 
isfiability problem, constraint satisfaction problem, autarkies, deficiency, polyno- 
mial time, matching autarkies, lean clause-sets, boolean translations, direct encod- 
ing, irredundant clause-sets, minimally unsatisfiable clause-sets, hitting clause-sets, 
disjoint DNF, hermitian defect, nested translation 



Contents 



1 Autarkies and deficiency ^ 

1.1 Introduction || 

1.1.1 Generalising the notion of deficiency ^ 

1.1.2 Examples: Colourings and homomorphisms |] 

1.1.3 Signed formulas and resolution || 

1.1.4 Overview and main results ^ 

1.2 Preliminaries ^ 

1.2.1 Variables and partial assignments |o| 

1.2.2 Graphs and matchings ^ 

1.3 Generalised (multi-) clause-sets ^ 

1.3.1 Syntax: The notion of "multi-clause-sets" ^ 

1.3.2 Semantics: The operation of partial assignments ^ 

1.3.3 Renaming variables ^ 

1.3.4 Conflict structure |7| 

1.3.5 Three operations of sets of variables on multi-clause-sets ... ^ 

1.4 Autarkies for generalised multi-clause-sets ^ 

1.4.1 The notion of an autarky ^ 

1.4.2 Autarky systems ^ 

1.5 Resolution ^ 

1.5.1 The resolution rule ^ 

1.5.2 The duality between resolution and autarkies |4| 

1.5.3 DP-resolution ^ 

1.6 Reductions ^ 

1.6.1 Basic reductions ^ 

1.6.2 DP-reductions |7| 

1.7 Matching satisfiability and deficiency ^ 

1.7.1 Matching satisfiable generalised clause-sets |o| 

1.7.2 The deficiency of generalised clause-sets ^ 

1.8 Satisfying assignments versus matching-satisfying assignments .... ^ 

1.8.1 The notion of matching-satisfying assignments ^ 

1.8.2 Matchings within satisfying assignments |4| 



1 



1.9 Matching autarkies ^ 

1.9.1 Matching autarkies for generahsed clause-sets ^ 

1.9.2 Matching lean generalised clause-sets ^ 

1.9.3 Matching lean clause-sets of minimal deficiency ^ 

1.9.4 Comparison with an earlier version of "matching autarkies" . ^ 

1.10 Finding autarkies in poly-time for bounded maximal deficiency . . . ^ 

1.11 Expansion and the surplus ^ 

1.11.1 Basic properties ^ 

1.11.2 Decreasing the maximal deficiency ^ 

1.11.3 Reductions H 

1.12 Conclusion and open problems ^ 

1.12.1 Matching autarkies |9| 

1.12.2 Satisfiable minimally matching-unsatisfiable clause-sets ... ^ 



2 Minimal unsatisfiability and conflict structure 52 

2.1 Introduction ^ 

2.1.1 Translations ^ 

2.1.2 Irredundant and minimally unsatisfiable clause-sets ^ 

2.1.3 Overview and main results ^ 

2.2 Preliminaries |4| 

2.3 The direct translation ^ 

2.3.1 The details of the translation ^ 

2.3.2 Preservation of general structure |7| 

2.3.3 Preservation of matching structure ^ 

2.3.4 Polynomial time SAT decision in the maximal deficiency . . . ^ 

2.4 Irredundant generalised clause-sets ^ 

2.4.1 Irredundant clause-sets ^ 

2.4.2 Hitting and multihitting clause-sets ^ 

2.4.3 Irredundant cores ^ 

2.5 Minimally unsatisfiable generalised clause-sets ^ 

2.5.1 Saturated minimally unsatisfiable clause-sets ^ 

2.5.2 Characterisation of the basic case of deficiency one ^ 

2.5.3 Stability parameter and minimal variable degree ^ 

2.6 The conflict structure of generalised clause-sets ^ 

2.6.1 A new boolean translation ^ 

2.6.2 The hermitian defect, and regular hitting clause-sets ^ 

2.7 Conclusion and open problems |o| 

2.7.1 Minimally unsatisfiable clause-sets of low deficiency ^ 

2.7.2 Conflict combinatorics ^ 

2.7.3 Translations to boolean clause-sets p2^ 



2 



Chapter 1 

Autarkies and deficiency 



1.1 Introduction 

Satisfiability problems with variables having more than two values occur naturally 
in many places, for example in colouring problems. Translations into boolean sat- 
isfiability problems are interesting and useful (see [^3[ ^ ^ for various 
techniques) , and may even improve performance of SAT solving, however they hide 
to a certain degree the structure of the original problem, which causes these trans- 
lations typically to be not very well suited for theoretical studies on the structure 
of the original problem. In this series of two article^ we study non-boolean satis- 
fiability problems closest to boolean conjunctive normal form, namely satisfiability 
of what is called generalised clause-sets (or sets of "no-goods"). Combining suit- 
able generalisations of boolean techniques with suitable translations into boolean 
clause-sets (preserving certain structures) we obtain non-trivial generalisations of 
fundamental theorems on autarkies and minimally unsatisfiable formulas. 

Three aspects of clauses (as combinations of literals) make processing of boolean 
clause-sets especially efficient: 

(i) When the underlying variable of a literal gets a value, then the literal becomes 
either true or false (this enables efficient handling of literals). 

(ii) Only by assigning a value to all the variables in a clause can we falsify the 
clause, and for each variable the value here is uniquely determined (this makes 
a tight connection between partial assignments and clauses, and enables "con- 
flict learning" by clauses). 

(iii) By giving just one variable a right value we are always able to satisfy a clause 
(this enables simple satisfaction-based heuristics). 

Taking these properties as axiomatic, a "generalised clause" should be a disjunction 
of generalised literals, and a "generalised literal" should have exactly one possibility 
to become false, while otherwise it should evaluate to true. We arrive naturally at 
the following concept for generalised literals (the earliest systematic use seems to be 
in [|lO|): A variable v has a domain Dy of values, and a literal is a pair {v, e) of the 
variable and a value e € Dy such that the literal becomes true under an assignment 
(f iS (p sets u to a value different than e (i.e., ip{v) e Dy \ {e}); to express this 
interpretation, often when displaying formulas we will write "w ^ e" for the literal 

^) based on the report pT] 
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(w, e). In case of Dy = {0, 1}, variable v becomes an ordinary boolean variable with 
the literal (w,0) representing the positive literal. We remark here that a fourth 
property of boolean clauses, namely that if all literals except of one are falsified, 
that then the value for the variable in the remaining literal is uniquely determined, 
which is the basis for the ubiquitous unit-clause propagation, is necessarily lost here. 

In this first part we investigate the basic combinatorial properties of gener- 
alised clause-sets and the basics of the theory of autarkies, while in the second part 
structural properties of minimally unsatisfiable generalised clause-sets (and, more 
generally, irredundant generalised clause-sets) will be investigated. In the following 
Subsections |l . 1 . 1| - L.1.3 we give an introduction into basic motivations and ideas, 
while in Subsection 1.1.4 we give an overview on the structure of this article and its 



main results. We remark that the most comprehensive and up-to-date overview on 
minimal unsatisfiability and autarkies in the boolean setting one finds in Chapter 
11 in the Handbook of Satisfiability (see [^). 



1.1.1 Generalising the notion of deficiency 

Using c{F) for the number of clauses in a clause-set, and n{F) for the number of 
variables, in the deficiency S{F) :— c{F) —n{F) has been introduced and made 
fruitful, for the study of minimally unsatisfiable boolean clause-sets as well as for 
the introduction of a new polynomial time decidable class of "matched" satisfiable 
(boolean) clause-sets: 

• Let AiU denote the class of minimally unsatisfiable clause-sets (unsatisfiable 
clause-sets, where each strict sub-clause-set is satisfiable). For boolean F G 
MU the property VF' C F : S{F') < S{F) has been shown; using S*{F) := 
maxpi(ZF S{F') for the maximal deficiency we get S*{F) = 5{F) as well as 
"Tarsi's lemma" 6{F) > 1 (since for the empty clause-set T C F we have 
<5(T)=0). 

• The class A4SAT of "matching satisfiable" clause-sets F is defined by the 
condition S*{F) = 0. All matching satisfiable clause-sets are in fact satisfiable, 
since by Hall's theorem the bipartite graph B{F) contains a matching covering 
all variables, where the vertices of B{F) are the clauses of F on the one side 
and the variables of F on the other side, while an edge joins a variable and 
a clause if that variable appears in the clause (positively or negatively). Or, 
using Tarsi's lemma, one argues that if F e M.SAT would be unsatisfiable, 
then F would contain some minimally unsatisfiable F' C F, for which 5{F') > 
1 would hold, contradicting S*{F) = 0. 

The study of the levels MU{k) of minimally unsatisfiable boolean clause-sets 
F with 5{F) < k has attracted some attention. In (where also Tarsi's lemma 
has been proven) the class SA4U of "strongly minimally unsatisfiable clause-sets" 
has been introduced, which are minimally unsatisfiable clause-sets such that adding 
any literal to any clause renders them satisfiable, and a nice characterisation of 
SA4U{1) — {F e SM.IA : 5{F) — 1} has been given (yielding polynomial time 
decision of SMU{1)). Then in |l^ a characterisation of MU{1) has been obtained, 
followed by a (partial) characterisation of A^Z//(2) in [|4| (achieving poly-time deci- 
sion), while in some subclasses of A^W(3) and A^W(4) have been shown to be 
poly-time decidable. For arbitrary (constant) fc G N it has been shown in |33| that 
for F G M.U{k) there is a tree resolution refutation using at most 2'"'"^ •n(F)^steps, 
and thus the classes A4U{k) are in NP (for fixed fc; by just guessing the refutation). 
In it has been conjectured that in fact all classes M.U{k) are in P (for fixed fc). 
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This conjecture has been proven true in [ p4| , using tools from matroid the- 
ory. Actually the classes SAT{k), consisting of all satisfiable clause-sets F with 
S*{F) < k, have been shown poly-time decidable, from which immediately poly- 
time decision of the classes AAU{k) and SM.lA{k) follows (all for arbitrary but fixed 
k). Regarding the method used, more precisely the classes USAT{k) of unsatisfiable 
clause-sets F with S*{F) < k have been shown poly-time decidable by improving 
the "splitting theorem" from Tree resolution refutations for F using at most 
2''~^ ■ n{F) steps and of a simple recursive structure have been obtained, so that 
these refutations can be found in polynomial time by means of enumeration of the 
circuits of the transversal matroid T{F) associated to the bipartite graph B{F), 
where the independent subsets of T{F) are the matching satisfiable sub-clause-sets 
of F. Independently also in poly-time decision of the classes A4U{k) has been 
derived by extending techniques from bipartite matching theory to directed bipar- 
tite graphs. Improving the proofs from [^^, the present author joined the team in 
p9| . Actually refining the techniques from , in fixed-parameter tractability 
of SAT{k) is shown (all this for the boolean case). 

After setting syntax and semantics for generalised clause-sets, the first main 
task tackled in the present paper is to transfer these results regarding the defi- 
ciency to generalised clause-sets. After suitably generalising the notion of deficiency 
and m atching satisfiability (which is not completely straight-forward; in Subsection 



1.9.4 an earlier version is discussed, which doesn't seem to have the right proper- 



ties) , in Corollary |1.8.7| the "satisfiability-based" approach from ]19[ yields polyno- 
mial time satisfiability decision for generalised clause-sets with bounded maximal 
deficiency. Generalising fixed-parameter tractability turns out not to be straight- 
forward (again), and only by combining the generalised approach with a suitable 
translation into the boolean case we arrive in Theorem 2.3.5| (Part II) at fixed pa- 



rameter tractability also for generalised clause-sets. The general framework for our 
considerations is autarky theory as started in ||45|| , with emphasise on matching 
autarkies as introduced in ^7\ . We remark that autarky theory provides a natu- 
ral connection to hypergraph colouring, as investigated in connection with "Polya's 
Problem" in ||53| (for the case of 2-colourings); see the subsequent subsection for 
the general notions. 

A key point for structural investigations in (generalised) clause-sets is to under- 
stand the effects of applying partial assignments; see for example |3^ , where 
splitting of minimally unsatisfiable boolean clause-sets is studied in some depth. In 
Part II we will consider the basic questions regarding irredundant and minimally 
unsatisfiable generalised clause-sets, which leads in a natural way to the study of hit- 
ting clause-sets and generalisations. The well-known classifications of the simplest 
case of minimally unsatisfiable clause-sets, namely boolean clause-sets of deficiency 
1, finds a natural generalisation in Theorem 2.5.5| (Part II, where again the proof is 



not straight- forward, caused by the breakdown of the "saturation method"). While 



minimally unsatisfiable clause-sets have a deficiency at least 1, in Corollary 2.6.5 
(Part II) we show (generalising ^^) that regular hitting clause-sets, generalised 
clause-sets which have a constant number of clashes between any two different 
clauses, have deficiency at most 1. The intersection of both classes, the unsatis- 



fiable regular hitting clause-sets, is characterised in Corollary 2.6.7 (Part II), and 
found for the special case of boolean clause-sets a recent application in , while 
we have generalised these results in . 

1.1.2 Examples: Colourings and homomorphisms 

Generalised clause-sets allow a natural representation of many combinatorial prob- 
lems, and in this section we discuss basic examples. 
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Given a hypergraph G and a set C of "colours" , a C-colouring of G is a map 
/ : V{G) — >■ C such that no hyperedge H G E{G) is "monochromatic" (that is, 
there must be vertices v,w d H with f{v) ^ /(w)). Translating this colouring 
problem into a generalised satisfiability problem Fc{G) is straightforward, using 
the vertices of G as variables with (uniform) domain C:^ 

• For hyperedge H G E{G) and colour e e C form the clause {w 7^ e : u e H}. 

• Fc{G) is the set of aU these clauses; thus n{Fc{G)) = \V{G)\ and c{Fc{G)) = 
\C\-\E{G)\. 

Obviously the C-colourings of G correspond 1-1 to the (total) satisfying assignments 
for Fc{G). Interesting examples of hypergraph colouring problems are given by the 
diagonal van der Waerden problems and the diagonal Ramsey problems. Computing 
van der Waerden numbers has been considered in |l^, |^ ^ ||, |5|, ^, ^ , and 
SAT solvers are performing quite well (likely the best method available), actually 
helping to compute new van der Waerden numbers^, so here is the problem: 

Consider natural numbers /c, n e N, and let the hypergraph ap(A;, n) have ver- 
tex set {1, . . . , n}, while the hyperedges of ap(fc, n) are the subsets H C {1, . . . , n} 
of size k which form an arithmetic progression, that is, for every H there exist 
a,d € {1, . . . , n} with H = {a + i-d : i G {0,...,fc — 1}}. Now for to e N the van der 
Waerden number vdwm(fc) is the minimal n such that ap(fc, n) is not TO-colourable. 
The corresponding generalised clause-sets are Fw('ti, fc,n) := ^{i,...,m}(ap(A:, n)), 
and if F^^{m, fc, n) is satisfiable, then vdwm(fc) > n, while if Fy/ijn, k, n) is unsat- 
isfiable, then vdwm(fc) < n; for to = 2 we obtain boolean clause-sets (I would like 
to point out how natural the translation is — no auxiliary variables are involved^. 
The only known precise van der Waerden numbers (besides the trivial values for 
TO = 1 or A; < 2) are vdw2(3) = 9, vdw2(4) = 35, vdw2(5) = 178, vdw3(3) = 27 
and vdw4(3) = 76, and all these numbers can be easily calculated using most cur- 
rent SAT solvers; furthermore recently in [^o| by (extensive) SAT-computations 
vdw2(6) — 1132 has been confirmed (as conjectured by [^). Directly express- 
ing the problem instance as a generalised clause-set (and skipping the hypergraph 
colouring problem), in this way also the non-diagonal versions of van der Waerden 
and Ramsey problems (making different requirements on the different colours) can 
be immediately translated into generalised clause-sets; see for some direct com- 
putations, and 1^, 1^, Q for more advanced computations using SAT solving (both 
regarding van der Waerden problems). In p8| we give an overview on the known 
van der Waerden numbers, and "Green- Tao numbers" are introduced, based on the 
celebrated strengthening of van der Waerden's theorem by the Green- Tao theorem 
(]27[|). For further applications of the mapping G Fc{G) from hypergraphs to 
clause-sets see 153, Ml . 



^)This translation generalises the well-known translation of graph 2-colouring problems into 
boolean CNF. Appending the tran slati on from generalised clause-sets into boolean clause-sets via 



the direct translation (see Section 2.3 in Part II), this combined translation also generalises the 
well-known direct translation of (arbitrary) graph colouring problems into boolean CNF. 

^) Ramsey problems seem harder for SAT solvers in the sense that these problems carry some 
structure unknown to the SAT solver (but known to the researcher), so that the problems related to 
unknown Ramsey numbers are rather big; see jTSj for the known numbers. Nevertheless improving 
the currently known bounds 43 < ram2(5) < 49 might be in reach for specialised SAT solvers 
(where ram2(5) is the minimal number of vertices in a complete graph such that every labelling 
of the edges with two "colours" must have a "monochromatic" 5-clique). Though one needs to 
say here that there is strong evidence for ram2(5) = 43 (see |73[), and so an apparently very hard 
unsatisfiable problem is to be tackled. 

The translation is the core of the two translations discussed in jl^ — the additional constraints 
used in [ |l7| | just express the structural property of a generalised clause-set that every variable gets 
exactly one value of its domain. 
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For the more general list-hypergraph colouring problem, for each vertex v a hst 
L(v) of allowed colours is given; this can be translated into a generalised clause-set 
Fc{G,L) by just restricting the domain of v to L{v). At this point it is worth 
noticing that also the still more general list-hypergraph-homomorphism problem has 
a direct (structure-preserving) translation into a satisfiability problem for gener- 
alised clause-sets. Given two hypergraphs Gi,G2 and for each vertex v £ V{Gi) 
a non-empty set L{v) C V{G2) of allowed image vertices, the problem is to find a 
map / : V{Gi) V{G2) with f{v) G L{v) for all v G V{Gi) such that for each 
hyperedge H £ E{Gi) we have f{H) e E{G2)- Note that if we take for G2 the 
hypergraph Gc with vertex set G and hyperedges all subsets of G with at least two 
elements, then the homomorphisms from Gi to G2 are exactly the C-colourings for 
Gi. For the translation of the list-hypergraph-homomorphism problem we use the 
set V{Gi) of vertices as the set of variables, while the domain of v is — L{v), 
and for each hyperedge H G E{Gi) and for each map f : H V{G2) such that 
for each v £ H we have f{v) € L{y) and such that f{H) ^ E{G2) holds, we 
have a clause Ghj '■— {v ^ f{v) : v G H}. Now satisfying assignments of the 
generalised clause-set F{Gi,G2, L) consisting of all clauses Ghj are exactly the 
hypergraph homomorphisms from Gi to G2 respecting the restrictions given by L. 
Note that the translation of hypergraph colouring problems is a special case via 
FciG,L)^F{G,Gc,L). 

The colourings considered above are also called "weak hypergraph colourings" , 
to distinguish them from strong hypergraph colourings of hypergraph G by a set G 
of colours, which is a map / : V{G) — t- G such that for all hyperedges H e E{G) 
and all v,w £ H , v ^ w, we have f{v) ^ f{w). Collecting all such binary clauses 
{f{v) ^ £, j{w) ^ e} for e e G we obtain a generalised clause-sets whose satisfying 
(total) assignments correspond 1-1 to the strong colourings of G. To conclude 
our list of translations for colouring problems we mention that mixed hypergraph 
colouring as studied in |Q takes a pair Gi, G2 of hypergraphs with V{G\) = V{G2), 
and the "mixed colourings" using colour-set G are (weak) colourings of Gi which 
are not strong colourings of G2 ; the most natural translation of this problem seems 
to consist of a pair of a generalised CNF (the translation of the (weak) colouring 
problem) and a generalised DNF (the negation of the translation of the strong 
colouring problem), using "monosigned literals" (see the next subsection), that is, 
allowing for inequalities "u ^ e" as well as equalities "v = e" . 

In the same vein as for hypergraph homomorphisms we can also translate ho- 
momorphism problems for relational structures: Let A = {A, (Ri)i^i) and B = 
{B, {R'j)i(zj) be two compatible finite relational structures, that is, A,B as well as 
/ are finite sets, the Ri are relations (of arbitrary arity) on A and the R'^ are re- 
lations on B, while Ri has the same arity as R'^. We want to express the set of 
homomorphisms f : A ^ B, defined by the property that for i € I and all x € Ri 
we have f{x) £ R'i, where / is applied componentwise to x. For this we choose 
A as the set of variables, which all have the same domain B, and for each i £ I, 
each X G Ri and each y G \ R'i, where m is the arity of Ri, we have the clause 
Gi,x,y '■— {^i Vi '■ i G {1, • ■ • ,™}}- We obtain the generalised clause-set F(^, B) 
by collecting all these clauses. The size of F{A, B) is polynomial in the sizes of 
A, B together with the number of tuples in Ri and the number of tuples not in 
R'i- The requirement that the homomorphism / is injective can be encoded by the 
binary clauses {a ^ b,a' ^ b} for a, a' E A with a ^ a' and for b G We 

^)On the other hand, to formulate surjectivity of / requires an exponential number of clauses; 
one sees that for the good combinatorial properties, which can be exploited for problems expressed 
in the language of (generalised) clause-sets (which is quite restricted from the constraint program- 
ming point of view), we have to pay the price that some natural problems do not have succinct 
representations (without using additional variables). 
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note that the translations F{Gi,G2, L) as well as F{A,B) are "direct" (homomor- 
phisnis are directly encoded as assignments) and "negative" (we use "forbidden" 
value combinations). 

If we wish to have F(A^ B) polynomial in the number of tuples in i?^, then we 
can use an "indirect" and "positive" translation as follows: Variables are pairs {i,x) 
for i € I and x ^ Rt, where the domain of variable (i, x) is i?^; so instead of mapping 
elements of A to elements of B, where constraints forbid that allowed tuples are 
mapped to disallowed tuples, here now we directly map tuples from relations in A 
to tuples in the corresponding relation in B, and the constraints will ensure that this 
mapping actually is induced by some mapping from A to B. The constraints are 
the unit clauses {(«, x) ^ y} for variables (i, x) and values y such that indices k, k' 
exist with Xk — Xk' but yk ^ yk', and the binary clauses {{i,x) ^ y, {i' ,x') ^ y'} 
for variables (z, x), {i', x') and values y, y' , such that an index k exists with 
but with yk ^ fk- 

1.1.3 Signed formulas and resolution 

Are there still more general versions of "generalised conjunctive normal forms" 
suitable in our context? The most general form of variable-based literals allows 
literals of the form "w G 5" for some S C Dy, and then (our) generalised literals 
{v, e) are represented using S = Dy\ {e}. See 0, |^ for entry points into the 

literature, where S is called a "sign" , while literals of the form "v € 5" are called 
"signed literals" , and clause-sets made of signed literals "signed CNF formulas" . 
More precisely literals "v G S"" are "positive" , while a "negative" literal is of the 
form "v ^ S" , and so generalised literals {v, e) correspond best to "v ^ {e}" instead 
of (as above) "w G Dy\ {e}"; obviously when considering all possible signs then 
the positive and the negative formulations are equivalent, but this changes when 
considering only special types of signs, as we will see below. See also [Q, which 
uses the same class of formulas, but calling them "nb-formulas" . Our generalised 
clause-sets, based on "w ^ e" , are "negative monosigned CNF formula" (the signs 
are singleton sets), while "monosigned CNF formula" allow also positive signed 
literals (with singleton signs; alternatively, negative signs with just one elements 
missing can be used) . We remark that in general negative literals naturally belong 
to CNF, while positive literals naturally belong to DNF. 

So the closest (further) generalisation of our "clause-sets" are "monosigned CNF 
formulas" , which allow besides "w ^ e also "w — e" . Considering this extension is 
also motived by the fact that these formulas correspond exactly to their boolean 
counterpart via the natural translation (which uses a boolean variable to express 
that a variable is equal (or not) to a specific value). However, monosigned for- 
mulas seem to lack the good combinatorial properties which "negative monosigned 
formulas" have. This can be seen for example by the fact that the boolean transla- 
tion of monosigned CNF formulas need the "AMO" clauses (expressing that every 
(original) variable gets at most one value) , making the translated formula unwieldy 
(from a combinatorial point of view), while the AMO clauses are not needed for 
negative monosigned CNF — here, without destroying the satisfaction relation we 
can just select some value in case an original variable gets several values, which is 
not possible in the presence of literals demanding that a variable gets some fixed 
value, since here those several values make more clauses true than any single of 
them. The point is that negative monosigned formulas need ALO, while AMO is 
optional, and positive monosigned formulas need AMO, while ALO is optional, and 
finally monosigned formulas need ALO as well as AMO. 

An important point has been raised in , where it has been shown that split- 
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ting on the boolean translation of generalised clause-sets can have an exponential 
speed-up over the (wide) splitting only available when splitting on the original ("neg- 
ative") literals, where one considers \Dy \ many branches for splitting on a variable 
V, each branch fixing a value of This seems to be an inherent weakness of using 
generalised clause-sets for SAT solving — they are not expressive enough to allow 
certain crucial inferences to be formulated succinctly. But actually our model of 
generalised clause-sets can allow the form of binary splitting corresponding to splits 
on the boolean translation as follows: Our literals can express only "w 7^ e", but 
since we allow arbitrary variable domains, we can have a binary splitting with a 
domain collapse H> {e} in one branch (splitting on the positive literal "w = e" ) 
and a domain restriction £)„ 1-^ D^, \ {s} in the other branch (i.e., splitting on the 
negative literal "v 7^ e"): In the first branch all literals with variable v would be- 
come true or false, while in the second branch possibly the literal stays, and only 
the domain of v is restricted (globally)J^ More generally, if {Di)i^j is a partition 
of Dy then we can split into |/| branches where in branch i variable v gets the new 
domain Di; if for a literal {v, e) we have e ^ Di, then the literal (and thus the clause) 
becomes true, while if Di = {e}, then the literal becomes false, and otherwise just 
the domain of v is restricted (globally) . The splitting trees for (generalised) clause- 
sets with domain-splittings "{{e},Dy \ {e})" correspond exactly to the splitting 
trees for the natural boolean translations. The price we have to pay however for 
this more powerful branching is, that if we stick with (generalised) clause-sets, then 
we can not have (full) clause learning — if we want to use clause learning, in this 
way reflecting the search process within the "clause-database", then at least for 
recording the learned clauses we need monosigned clauses to record these binary 
splittings (and signed literals for more general domain splittings). ^ This distinction 
between the "logic" representing the input and the "logic" representing branching 
will be further developed in the OKlibrary^, a generative library for generalised 
SAT solving ([|6j gives an introduction into the general ideas of this "research plat- 
form"): Problem instances can be represented by arbitrary structures (based on the 
axiomatic approach in ) , while the branching logic (inference and learning) uses 
various types of "literals" appropriate for the algorithms used. 



1.1.4 Overview and main results 



In Section 1.2 we present some preliminaries for our study of generalised clause- 
sets: Partial assignments for non-boolea n va riables, and fundamental notions and 
notations for graphs. Then in Section 1.3 generalised clause-sets and the main 
associated operations are introduce d. A utarkies and autarky systems for ge neralised 
clause-sets are reviewed in Section |l.4| (a useful result here is Lemma 1.4.2 , showing 
how to actually find a non-trivial autarky when just given an oracle deciding whether 
a non-trivial autarky exists or not). Resolution for generalised clause-sets is the 



The corresponding form of resolution has been studied in some depth in (generalised there 
through the use of oracles), and is considered in this paper in Section 1.5|. 



'^^That is, since in the second branch we do not assign a value to variable v, we do not get rid 
off V in the second branch. As a consequence, we need a global domain management. 

*^This discussion shows in my opinion a major reason, why generalising boolean reasoning 
proved to be difficult in the past, and (boolean) SAT solvers have an edge over constraint solvers: 
Either we restrict ourselves to wide branching, which has inherent inefficiencies, or we use more 
powerful branching, and then we have to use a more complicated domain management than in the 
boolean case (where none is needed), and finding out whether a literal actually became true or 
false becomes considerably more complicated (while it is trivial in the boolean case). Furthermore, 
if we want to use learning, which seems of importance for many "real-world" problems, then we 
have to use more complicated literal structures, and domain and literal (occurrence) management 
gets fur ther complicated. 



^'Sce ittp : //www . ok-sat-library . org 
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subject of Section |l.5| (in Theorem 1.5.1 it is proven that a clause can be used in 



some resolution refutation iff it can not be satisfied by some autarky; computation 
of the lean kernel via "intelligent backtracking solvers" follows). And the most basic 
polynomial time reductions for generalised clause-sets are presented in Section 1.6. 



After these preliminaries and foundations, we turn to the main subject of this ar- 
ticle, the study of matching autarkies and their uses. First in Section 1.7 the notion 
of matching satisfiable clause-sets (introduced in |^ under the name of "matched 
clause-sets") is generalised in a natural way to generalised clause-sets, based on the 
generalised notion of deficiency. Matching satisfiable clause-sets are satisfiable by 
special satisfying assignments, called "matching-satisfying assignments" , and these 
assignments are studied in Section |l.8| . Theorem 1.8.4 is the first main result, 
guaranteeing the existence of satisfying assignments "close enough" to matching- 
satisfying assignments. In Corollary 1.8.7 we derive poly-time satisfiability deci- 



sion for generalised clause-sets with bounded maximal deficiency, generalising and 
strengthening the approach from [^9j . Proving fixed parameter tractability with re- 
spect to the maximal deficiency for generalised clause-sets will be establ ished later, 
in Theorem 2.3.5| (Part II), a nd that res ult will not use Theorem 1.8.4 . The main 
application of Theorem 1.8.4| is Theorem 1.10.3 , where the harder problem of find- 
ing a non-trivial autarky (not just a satisfying assignment) is shown to be solvable 
in polynomial time in the maximal deficiency. 

Sectio n |l.g| now studies matching autarkies for generalised clause-sets. In 
Subsection 1.9.1 matching autarkies for generalised clause-sets are introduced, and 
the main properties a re pro ven. A typical result here is the generalisation of "Tarsi's 
Lemma" in Corollary 1.9.9 (every generalised minimally unsatisfiable clause-set has 
defici ency at least one), exploiting the notion of matching leanness. In Section 
1.10 , as already mentioned, we strengthen polynomial time satisfiability decision 
w.r.t. bounded maximal deficiency S*{F) to t he ab ility of computing the lean kernel 
(in polynomial time). Finally in Section [l.ll| the notions of "expansion" and 
"surplus" are transferred from matching theory, yielding a simplified proof of FPT 
for SAT decision w.r.t. the parameter 5*{F) in the boolean case (however we do 
not get a proof for the general case). Many interesting open problems arose in the 
journey so far, and we discuss some of them in Section |1.12| . 



1.2 Preliminaries 



We use N = Z>i and No = Z>o. For a set X by Sx the group of all bijections from 
X to X is denoted, while for n G No we set Sn ■= 'S'{i,. ..,„}. 



1.2.1 Variables and partial assignments 

Fundamental for our considerations is the monoid (73455, o,()) of partial as- 
signments. Here we just recall the basic definitions, while a full account (for the 
boolean case) can be found in . 

The universe of variables is denoted by the infinite set V^, while the universe 
of domain elements is the infinite set X>CA1; a (value-)domain is a finite non- 
empty subset of VOM, and for each variable v G VA we denote the associated 
(value-) domain by D^; thus variables have fixed (value-) domains, and change of 
domain (for example removal of values) must be performed by renaming. For a 
domain D by VA-d the set of all variables with domain D is denoted; to avoid 
running out of variables and to ease renaming, we make the assumption that for all 
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domains D the set VA-d has the same cardinahty as VA itself. A variable v S VA is 
called boolean if Dy = {0, 1} (and thus V^{o,i} is the set of all boolean variables). 

A partial assignment is a map with finite domain var(y;) := dom((p) C VA, 
such that for all v G va.r{(f) we have (f{v) e Dy. The domain size of a partial 
assignment (p is denoted by n(ip) := ]var(<y9)| G Nq. A special partial assignment is 
the empty partial assignment (). The set of all partial assignments is denoted by 
VASS, while for some set VA' C VA of variables we denote by VASS(VA') := 
{(f E VASS : var((/3) C VA'} the set of partial assignments with variables from 
VA' (thus TASS{VA^o^iy) is the set of partial assignments for boolean variables). 
We use the notation {vi ei, . . . ,Vm Sm) to denote the partial assignment <^ 
with n{Lp) = m and f{vi) = Ej (for distinct variables Vi, or for literals with distinct 
underlying variables). 

For two partial assignments (p,tp & JASS their composition <^ o ■0 is defined 
as the partial assignment ip o ijj with domain var((/j o tp) = var(</?) U var(i/)) such 
that first ^ is evaluated and then 'p, i.e., {ip o 'ij)){v) = ip{v) if f G vaT^tp) while 
otherwise (</? o 'ip){v) — ip{v). An important basic observation is that {TASS, o, ()) 
is a monoid. An alternative representation of this structure is obtained as follows: 
Make each Dy a ("right zero") semigroup {Dy, ■) by defining ei -£2 '■= £2 for £1,62 G 
Dy. Adjoin an identity element "*" to each Dy, obtaining monoids £)*. Now 
VASS is isomorphic to the direct sum 'YlveVA ^* °^ ^^'^ monoids (the sub-monoid 
of the direct product rii;ev.>t ^* given by those elements where only finitely many 
components are different from *), where G VASS corresponds to the map p* G 
n^gv^ £>* with ip{v) — p* (v) for v G var((^) and ip>* {v) = * for w G VA\vm:{ip) . This 
representation of partial assignments as total maps with distinguished "undefined" 
value * actually has certain advantages over representation using partial maps, since 
working with total maps is often easier than working with partial maps, and we get 
a somewhat richer algebraic structure; however in this article we stick to the above 
representation of partial assignments. 

1.2.2 Graphs and matchings 

A (finite) graph G here is a pair G = {V,E) with finite vertex set V{G) = V and 
edge set E{G) =E^{^^), where for a set M and G No by (^) we denote the set 
of all subsets TQM with |T| = k. So graphs here have no parallel edges and no 
loops. A graph G' is a subgraph of a graph G if V{G') C V{G) and E{G') C E{G); 
G' is called a partial subgraph of G if G' is a subgraph of G and V(G') = V{G). 
A graph G is called a forest if G contains no cycle. A graph G is complete if all 
distinct vertices v,w € V{G) are adjacent. G is bipartite, if the chromatic number 
of G is at most 2, while G is complete bipartite if G is bipartite and addition of 
any edge to G either destroys the graph property (i.e., creates a loop or a parallel 
edge) or destroys the bipartiteness property. More generally, G is called complete 
k-partite for fc G No if the chromatic number of G is at most k, and addition of any 
edge to G either destroys the graph property or increases the chromatic number. 
G is complete fc-partite iff G is the union of at most k independent sets, such that 
each pair of vertices from different independent sets is adjacent (equivalently, iff the 
complement of G is the disjoint union of at most k cliques). 

A function / : 5 — ^ R, where S is some set system stable under union and 
intersection, is called submodular resp. supermodular if for all A,B G 5 we have 
f{A UB) + f{A nB)< /(A) + f{B) resp. f{A U B) + f{A n B) > /(A) + f{B), 
while / is called modular if / is submodular and supermodular. A prototypical 
example for a modular function is A C X i-> f{A) := \A\, where X is some finite 
set. For a graph G and a vertex set A C V{G) the (closed) neighbourset Tg{A) is 
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defined as the set of vertices adjacent to at least one element of A. The function 
A C V{G) t—^ |r(A)| is a prototypical example for a submodular function, while the 
deficiency S{A) := \A\ — \T{A)\ G Z is a supermodular function (as the difference of 
a modular function and a submodular function). 

A matching M in a graph G is a set M C E{G) of edges such that two distinct 
elements of M are non-adjacent. A matching in G which is of maximal size is called 
a maximum matching, and the size of a maximum matching M of G is denoted by 
i/(G) := \M\. If G is a bipartite graph with bipartition {A,B) (also called "colour 
classes"), then iy{G) = \A\ - S*{A) = \B\ - 5*{B), where for any S C V{G) we 
set 5*{S) :— maxg/gg (5(5")f^ A maximum matching in a bipartite graph can be 
computed in time 0{y/v{G) ■ \E{G)\) < 0{y^\V{G)\ ■ \E{G)\) (see Theorem 16.4 in 

A maximal matching M in a graph G is one which can not be extended (that 
is, there is no matching M' in G with M C M'), while the vertices covered by a 
matching M are the vertices incident to one of the edges in M. An M -augmenting 
path for a matching M in G is a path P of odd length with endpoints not covered by 
M and whose edges are alternately out of M and in M (so necessarily start edge and 
end edge (which might coincide) are out of M) . A new matching M+ is obtained 
by adding the edges from P to M , which were not in M , while removing the other 
edges of P from M; we then have |Af+| = \M\ + 1. A matching M in a graph G has 
an augmenting path if and only if M is not maximum (see for example Theorem 16.1 
in Q). If G is bipartite, then deciding whether M has an augmenting path, and 
finding one if existent, can by done by breadth-first search in the directed bipartite 
graph naturally associated with the notion of augmenting paths (see Section 16.3 
in @), and so this process takes time 0{\E(G)\. Using this process to construct a 
maximum matching, starting with the empty matching, takes time 0{v{G) ■ \E{G)\) 
which is worse than the bound given above, however if a matching M of "reasonable 
size" is already given, then the time 0{{v{G) — \M\) ■ \E{G)\) it takes to construct 
a maximum matching, starting with M, might be better. 

A vertex cover of a graph G is a set T C V{G) of vertices such that every edge of 
G is incident with (at least) one of the vertices in T; a vertex cover of minimal size is 
called a minimum vertex cover, the size of a minimum vertex cover of G is denoted 
by t{G). For bipartite graphs G we have t(G) = I'iG), and given a maximum 
matching of G, in time 0(|_B(G)|) a minimum vertex cover can be computed (see 



Theorem 16.6 in |80|). A witness for v{G) > k can always be given by a matching 
M in G with \M\ > k, while for bipartite graphs G a witness for J^(G) < k can 
always be given by a vertex cover T with |r| < k. For our applications, witnesses 
using the notion of deficiency are more useful (they will yield autarkies in Lemmas 



1.11.1 



and 1.11.8): Given a bipartite graph G with bipartition (A, B) and vertex 
cover T of G, the set A' := ^ \ T has deficiency S{A') > |A| - |T| (while given 
A' C A, the vertex cover T{A') U{A\ A') has size \A\ - S{A')). 



1.3 Generalised (multi-) clause-sets 

In t his sec tion we review the notion of generalised multi-clause-sets. In Subsec- 



tion 



1.3.1| we introduce "generalised multi-clause-sets" and "generalised clause- 



^"'See for example Theorem 22.2 in |80|, where the notion of "transversals" or "systems of distinct 
representatives" of a set system is usea (not to be mixed up with "transversals" in hypergraphs), 
and where the set system is (Tq ({a}))n6A resp. (rG{{^}))beB ■ 

^'^^We won't dwell here on the details of graph representations; however when stating complexity 
results for (multi-)clause-sets we will be more precise. 
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sets", while in Subsection 1.3.2 (partial) assignments and their operation on (multi- 
)clause-sets are discussed. Besides this substitution of truth values into variables 
we only need another special case of general substitution, namely the renaming of 
variables, which is treated in Subsection 1.3.3. Two different clauses by definition 
have a "conflict" if they do not have a common falsifying assignment, and some 
basic notions regarding the "conflict graph" of a (multi-)clause-set are introduced 
in Subsection 1.3.4| . This introduction into "syntax and semantics of generalised 
clause-sets" is completed in Subsection 1.3.5 with the discussion of various opera- 
tions on (multi-)clause-sets F regarding their variable structure (disregarding the 
different "polarities" of variables, i.e., when considering literals, then here only the 
underlying variables play a role). 

For more background information, see ^ for a general, axiomatic framework 
for "generalised satisfiability problems", while in Subsection 2.3 of generalised 
clause-sets are discussed, and in Section 2 of Q boolean multi-clause-sets are 
considered (see also for more information). In this article, when we speak 
of "clause-sets" then we always mean "generalised clause-sets", while clause-sets 
in the "traditional" sense are always qualified as "boolean clause-sets" ; however 
in lemmas, corollaries and theorems we speak of "generalised clause-sets" to ease 
independent access. 



1.3.1 Syntax: The notion of "multi-clause-sets" 

A literal is a pair (w,£) of a variable v £ VA and a value e G Ui,; we write 
var(t), e) := v and val(t>, e) := e. The set of all literals is denoted by CTT, and 
for any VA' C VA we write CITiVA') := {x G LIT : var(a;) G VA'} for the 
set of literals with variables from VA' (thus CIT{VA{o,i}) is the set of boolean 
literals). For a partial assignment if G VASS and a literal {v,e) with v G var((/j) we 
set ip{{v,e)) = 1 if if{v) ^ e, while we set (f{{v,e)) = if ip{v) — e; thus a literal 
{v, e) has the meaning "z; shall not get value e". Accordingly a literal {v, e) is often 
denoted by "w ^ e" . 

A clause C is a finite set of literals not containing "clashing literals" , that is 
for literals x,y € C with x ^ y we have var(a;) 7^ vai{y). The set of all clauses is 
denoted by CC For a clause C we set var(C) :— {var(a;) : x G C}, and for a set 
VA' C VA we write CCiVA') := [C eCC: var(C) C V^'} for the set of clauses 
with variables from VA! (thus CC{yA{Q,i}) is the set of boolean clauses). The empty 
clause is denoted by _L G CC. 

Given a clause C, we obtain the corresponding partial assignment ipc S VASS 
as the partial assignment with var((^) — var(C) and ip(v) = e for (f,e) G C; on 
the other hand, given a partial assignment Lp, we obtain the corresponding clause 
G CC as the clause C with var(C) — var((/3) such that for (^(w) = e we have 
(w,e) G C. Using the representation of maps as ordered pairs of arguments and 
values, actually (fc — C and Cip = ip (and thus CC = VASS)\ explicitely said, a 
clause corresp ond s to the partial assignment which sets exactly the literals in the 
clause to false. ^ 

A (finite) multi-clause-set is a map F : CC — >■ No, assigning to each clause its 
number of occurrences, such that only for finitely many C G CC we have F{C) ^ 0, 
while a (finite) clause-set is a finite subset of CC. Clause-sets F can be implicitly 
converted to multi-clause-sets by setting F{C) :— 1 for C £ F and F{C) := 
otherwise. For a multi-clause-set F the underlying clause-set t{F) is defined as 

I2)'p]jg motivation is, that with a partial assignment ip we restrict the search space, and in case 
the partial assignment is inconsistent with the clause-set _F, then the clause can be "learned" 
(i.e., follows from _F). 
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t(F) = {C e CL : F{C) ^ 0}. This conversion is only performed if necessary to 
apply an operation only defined for clause-sets. We have C E F for a multi-clause- 
set F iff F(C) > 0, and we set var(F) := U{var(C) : C G F}. For a variable veVA 
we define val„(i^) := {e G _D.„ | 3 C G F : (w, e) G C} as the set of occurring values. 
We have var(F) = {u G VA : valt,(F) ^ 0}. Finally the empty clause-set as well as 
the empty multi-clause-set is denoted by T. 

A clause-set F has a uniform domain if \/v,w G var(F) : _D„ — holds. 
Boolean clause-sets have uniform domain, and every finite (generalised) clause- 
set has a "domain-uniformisation" by using the union of all relevant domains and 



adding unit-clauses to forbid unwanted domain elements; see Subsection |1.3.3| for 
details. 

We use the following complexity measures for multi-clause-sets F: 

1. := J2c€F (v e)ec ^ ^0 measures the number of occurrences 
of a literal; 

2. := EeGr'„ = EceF,i,Gvar(c) ^i^) ^ measures the num- 
ber of occurrences of a variable; 

3. S(^,^)(F) := Ee'ei3„\{e} #(v.e){F) = #v{F) - #(.„,e)(^) ^ Nq mcasurcs the 
number of occurrences of literals with variable v and value different from 
e (this is the number of satisfied clauses when assigning value e to u; see 
Subsection 1.3.21 ); 



4. n{F) |var(F)| G No measures the number of variables; 

5. c{F) :— J2ceF ^i^) ^ measures the number of clauses; 

6. eiF) := EcGpFiC) ■ \C\ = E.Gvar(F) #viF) G No measures the number of 
literal occurrences. 

And for multi-clause-sets Fi, F2 we use the following operations and relations: 

1. the mult i- clause- set Fi + F2 is defined by 

(Fi+F2)(C) :-Fi(C)+F2(C) 

for clauses C; 

2. the multi-clause-set Fi U F2 resp. Fi n F2 is given by 

(FiUF2)(C) max(Fi(C),F2(C)) 
(FinF2)(C) min(Fi(C),F2(C)) 

for clauses C; if Fi, F2 are clause-sets, then these operations coincide with the 
ordinary set operations; 

3. if F2 is a clause-set, then the multi-clause-set Fi \ F2 is defined by setting 
(Fi \ F2)(C) := for C G F2, while otherwise (Fi \ F2)(C) := Fi(C); if also 
Fi is a clause-set, then Fi \ F2 is the ordinary set operation; 

4. the relation Fi < F2 holds if for aU clauses C we have Fi(C) < F2(C); we 
use F' < F for F' < F A F' 7^ F; if Fi,F2 are clause-sets, then Fi < F2 ^ 
Fi C F2; 

5. Fi is called a sub-multi-clause-set of F2 if Fi < F2 holds, while Fi is called 
an induced sub-multi-clause-set of F2 if Fi < F2 and VC G Fi : Fi(C) — 
F2(C) holds; every sub-clause-set of a clause-set is induced; 

6. if F2 is a sub-multi-clause-set of Fi, then the multi-clause-set Fi — F2 is 
defined via (Fi - F2)(C) := Fi(C) - F2(C) for clauses C. 
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The sot of all multi-clause-scts is denoted by A/iCjCS, the set of all clause-sets 
by CCS, while for a set VA' C VA of variables we use M.CCS{VA') := {F e 
MCCS : var(i^) C V^'} and CCS{VA') := {F € CCS : var(F) C VA'} (thus 
MCCS{VA[o.i}) is the set of boolean multi-clause-sets, and CCS{VA^o,i}) 
set of boolean clause-sets). If C is a set of multi-clause-sets and / : C — >■ M, then by 
Cf<b for some 6 e M we denote the set of all F e C with f{F) < 5; analogously we 
define Cf=h, Cf>b and so on. A special function usable here is sat : CCS — > {0, 1} 
with sat(F) = 1 F € SAT (that is, sat is the characteristic function of the set 
of satisfiable clause-sets defined below); we can combine several such indices, and 
for typographical reasons we may use then for example A1C£<S?^j . 



1.3.2 Semantics: The operation of peirtial assignments 

Now we define the operation * : JASS x MCCS AdCCS of JASS on multi-clause- 
sets, and the (derived) operation * : "^455 x CCS — > CCS on clause-sets, which in 
both cases have the meaning of substituting values for variables and carrying out 
the resulting simplifications (viewing a clause as a disjunction of its literals, and a 
(multi-)clause-set as a conjunction of its clauses), with the only difference that in 
the case of clause-sete contractions in the result are carried out (distinct clauses can 
become equal after a substitution). The case of clause-sets is reduced to the case of 
multi-clause-sets, using the explicit transformation t : CCS — > A4CCS of clause-sets 
into multi-clause-sets. For F e MCCS and ip e PASS we define tp * F e MCCS 

by 

{^*F){C)= J2 ^(^'), 

c'ecr. 

for C e CC, where for a clause C we set (p*{C} := T £ CCS if there exists a literal 
X G C with (f{x) = 1, while otherwise we set (f * {C} := {C \ C^} G CCS, i.e., we 
remove the falsified literals from C. And for F e CCS we define (p* F G CCS as 

tp* F :=i{(p*i{F)). 

We have hero (whore F is a clause-se<) (p * F = IJceF ''^ * {^}- The effect on 
the basic measures of applying a partial assignment {v ^ e) to F G MCCS with 
V G va,i{F) is given by 

n{{v -^e)*F) < n{F) - 1 
c{{v^e)*F) = c(F)-S(,,e)(i^). 

A clause-set F G CCS is satisfiable if there exists a partial assignment (p E TASS 
with (p*F = T, while otherwise F is unsatisfiable; the set of all satisfiable clause- 
sets is denoted by SA.T, the set of all unsatisfiable clause-sets by US ATT. A multi- 
clausc-sct F G MCCS is called minimally unsatisfiable if F is unsatisfiable, but 
every F' < F \s satisfiable; obviously if F is minimally unsatisfiable, then F actually 
is a clause-set. The set of all minimally unsatisfiable clause-sets is denoted by M.U. 

It is useful to have some notations for the set of satisfying assignments ("mod- 
els") as well as for the set of falsifying assignments. For a finite V C VA let 
VASS{V) be the set of G VASS with var((/?) = V. Note that we have 

\vAss{v)\ = ni^-i- 

Now for a clause-set F G MCCS and for a finite set V of variables with var(F) C V 
let modv(-F) be the set of G VASS{V) with ip*F = T, while falv(-F) is the 
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set of (/9 G VASSiV) with ± e tp^F. Thus F is satisfiable iff mody(F) 7^ 0; and 
for any clause C with var(C) C y we have 

|faV({C})| = |mS5(y \ var(C))| = [] l^-l' 

j;eV\var(C) 

that is, the falsifying assignments for a clause C are uniquely determined on vari- 
ables from C and arbitrary elsewhere. At this point it might be useful to point out 
that for (multi-)clause-sets typically falsifying assignments are significantly easier 
to handle than satisfying assignments ("those elusive idols", since we are dealing 
with CNF; for DNF the situation is reversed). Obviously mody(i^) n faly(i^) = 
and mody(f ) U faV(F) = VASSiV). By definition we have 

faV(F) = y Mvi{C}). 

For clause-sets Fi, F2 wc write Fi |= F2 ("Fi implies F2") if for all ip € VASS with 
Fx =T we have * F2 = T as well, and for clauses C we write F \= C instead 
of F 1= {C}. Trivially F is unsatisfiable iff F |= _L. Note that Fi |= F2 holds ifl[ for 
V := var(Fi) U var(F2) we have faV(F2) C faV(Fi). We call Fi,F2 equivalent if 

Fi ^ F2 and F2 h Fi holds. 

The basic laws for the operation of partial assignments on multi-clause-sets are 
as follows, using F, Fi, F2 G MCCS and ip,tp e VASS: 

()*F = F 
* T = T 

o -i/;) * F = (/9 * (l/i * F) 
(f* {Fi + F2) = (fi * Fi + (f * F2. 

The first three laws hold also for the operation of partial assignments on clause-sets, 
while the composition of multi-clause-sets by addition is to be replaced by union, 
that is for Fi , F2 G CjCS we have 

* (Fi U F2) = * Fi U * F2 

(note that in general this does not hold for multi-clause-sets, where we also have 
defined union). Furthermore for a multi-clause-set F and a clause-set F' we have 
ip*{F\F')>{ip*F)\{ip* F'). 

1.3.3 Renaming variables 

Consider a multi-clause-set F and variables v^w G VA (which might be equal) 
together with h : Dy ^ such that in case of v ^ w we have w ^ var(F). Then 
replacing v by w using ft, in F results in the multi-clause-set F' where every 
occurrence of a literal (v.e) is replaced by the literal (w,h(e)). The map h here 
is called the value transfer; if Dy C Dy, and h is unspecified, then the canonical 
injection is used. 

Similarly, replacing w by w using /i in a partial assignment ip, where in case 
of V ^ w we have w ^ var((p), results in a partial assignment tp' with dom(<y9') = 
(dom((^) \ {v}) U {w} such that <p'{u) = p}{u) for u G dom((/5') \ {w}, while p'{w) = 
h{(p{v)). Here the value transfer needs to be specified only for the special value 
ipiv). If V = w, then we just speak of flipping v to e in (/3 for e = h{p{v)). 

The replacement of by w using /i in F is injective, if for literals {v,e), {v,e') 
occurring in F with e ^ s' we have h{e) ^ h{e'). If \Dy,\ > :^y(F), then there is 
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always some h : Dy — >■ such that replacing v hy w in F using h is injective. For 
every injective h, replacing v hy w in F using h is injective. Note that injective 
replacements alter the "meaning" (the set of models modulo isomorphism) exactly 
in the case where a non-pure variable (a variable such that all values occur in 
F; see Subsection 1.6) is rendered a pure variable by using a domain Du, with 
\Dw\ > ^^viF). Special injective replacements are renamings, where his a bijection 
from Dy to Dy,. If we have a renaming of u by it; using h in F, resulting in F', 
then we also have the renaming of w to w using in F' , resulting in F. So 
the satisfying assignments for F' here are exactly the satisfying assignments for F 
where v is replaced by w using h. If several variables are renamed simultaneously, 
then we require that the same result is obtained by renaming single variables one 
after another (in some order). 

For a multi-clause-set F a domain-uniformisation of F is for var(F) = 0, 
while otherwise we consider the domain D := U«evai(_F) rename all variables v 
in F to v' with domain Z3, and add all unit clauses {v' 7^ e} for e G -D \ Dy. 



1.3.4 Conflict structure 

A study of the "combinatorics of conflicts" for boolean clause-sets has been initiated 
with ^ and continued with ^ . We generalise here only a very few simple 
notions used later in this article. 

The conflict graph cg(F) of a clause-set F G A4CCS has as vertices the 
clauses of F, and edges joining two vertices C, D with a clashing literal between 
C and D, that is, there is a literal a; G C for which there exists a literal y £ D with 
var(a;) = var(j/) and x ^ y. While the conflict graph does not take multiple clashing 
literals into account, the conflict multigraph cmg(i^) contains as many parallel 
edges between two clauses as there are conflicts. A clause-set F is called a hitting 
clause-set if the conflict graph of F is a complete graph, and the hitting degree 
hd(i^) G N of a hitting clause-set with at least two clauses is the maximum of the 
number of (parallel) edges joining two different vertices in the conflict multigraph 
of F. More specifically we call F an r-regular hitting clause-set for r G Nq 
if every two different clauses in F have exactly r conflicts (thus if F is r-regular 
hitting for r > 1, then F is hitting), while a regular hitting clause-set is an 
r-regular hitting clause-set for some r > 0, and we denote the set of regular hitting 
clause-sets by TiTiXTPl 



1.3.5 Three operations of sets of variables on multi-clause- 
sets 

Finally we consider various operations with sets of variables. The operation V * F 
is defined for finite V CVA and F G MCCS via 

(F*F)(C):= ^(^')' 

c'ecc 
v,c'=c 

where for a clause C we set F * C := {2; G C : var(x) ^ V} G CC. That is, V * F 
is obtained from F by crossing out all literal occurrences x with var(x) G V . Two 

Regular hitting clause-sets were called "uniform hitting clause-sets" in jjo] , but "regular" 
seems a better choice, since regularity refers to constant degree, while uniformity typically refers 
to constant hyperedge-size, and so should better be used for clause-sets of constant clause-size. 
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basic properties are 



var(y * F) 
c{V * F) 



var(i^) \ V 
c{F). 



The operation V * for F G CCS is defined by 

y*F := t(y *t(F)) eccs. 

We have here F * F = {y * C : C G F}. The basic laws for F, Fi, F2 G MCCS and 
finite V, V C are 



Again, the first three laws also hold for the operation of sets of variables on clause- 
sets, while for Fi, F2 G CCS we have 



(again this does not hold for multi-clause-sets in general). 

We conclude by discussing the two basic ways of selecting parts of a multi- 
clause-set. By F\r we denote the induced sub- multi-clause-set of F with C G Fy <^ 
var(C) n y 7^ 0; in other words, Fy = F \ {C G F : var(C) n = 0}. Basic 
properties are: 

1. F0 = TandF,,,(f.)=F\{±}. 

2. If Vi C V2, then Fy^ is an induced sub-multi-clause-set of Fyj. 

3. Fy^uV2 — U Fv2- 

4. For V eVAwe have c(F{„}) = #„(F). 

Additionally restricting the variables is achieved by 

F[V] (var(F) \V) * Fy ^ ((var(F) \V)*F)\ {_L} G MCCS, 

which, in analogy to the same process for hypergraphs (using usually also the same 
notation, see for example can be considered as the "restriction of F to V" . 

Basis properties are 

1. F[0] = T and F[var(F)] ^ F\ {_L}. 

2. c{F[V]) = c(Fy), var(F[l/]) C var(Fv). 

3. var(F[y]) = V tor V C var(F). 

To summarise: We obtain V * F from F by keeping all clauses but removing those 
literals x from them with var(a;) G V , while we obtain Fy from F by removing those 
clauses C from F with var(C) ("1^ = (while keeping all clauses intact); finally 
F[V] is obtained from F by first constructing Fy, and then crossing out all literal 
occurrences for literals x where there exists a clause C ^ F with var(C) OV — ^ 
and var(a;) G var(C). 

F[V] is the formula derived from F when we want to consider total assignments 
relative to the variable set V, and is basic for the theory of autarkies reviewed in 
the subsequent section, while V * F and Fy are fundamental constructions. An 
example for these operations: 



* F = F 



{V(JV')*F 'i'iV' ^ F) 

* (Fi + F2) = y * Fi + y * F2. 



y * (Fi u F2) = y * Fi u y * F2 
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1. Consider boolean variables a,b,c (the domains of variables do not matter). 

2. Let 

(a) Ci :={(a,0),(6,l),(c,0)}, 

(b) C2 :={(a,0),(6,0),(c,l)}, 

(c) C3 :={(a,l),(6,0),(c,l)}, 

(d) C4:={(6,l),(c,l)}. 

3. Let F :— F corresponds to the boolean CNF 

(a V 6 V c) A (a V 6 V c) A (a V 5 V c) A (5 V c). 

4. Now we have 

(a) i^{a} = 

(b) {a} * F = {{(6, 1), (c, 0)}} + 2 . {{(6, 0), (c, 1)}} + {{{b, 1), (c, 1)}}, 

(c) F[{a}] = 2.{{(a,0)}} + {{(a,l)}}. 



1.4 Autarkies for generalised multi-clause-sets 

Of central importance to our work is the notion of autarkies, such partial assign- 
ments which satisfy parts of the formula and leave the rest untouched. Autarky 
systems allow to tailor the notion of autarkies to special purposes. We review here 
the general properties of autarkies and autarky systems for generalised multi-clause- 
sets. See Section 3 in P6| for a general theory of autarkies and autarky systems, 
while in Section 4 of p6 autarky systems for generalised clause-sets have been 
discussed (easily generalised to autarky systems for generalised multi-clause-sets). 
General properties of autarkies for boolean clause-sets are thoroughly investigated in 
p5| , Section 3, while autarky systems for boolean clause-sets have been introduced 
in (see Sections 4 and 8 there for the general theory). For boolean clause-sets, 
the handbook chapter [ p5| gives an overview on autarky theory, while the textbook 
[7l| introduces basic properties of autarkies. 

1.4.1 The notion of an autarky 

A partial assignment (p G TIASS is an autarky for F e AiCCS if one (and thus all) 
of the following four equivalent conditions is fulfilled: 

1. for all clauses C G F we have var((p) n var(C) 7^ * {C} = T; 

2. VF' <F -.tp^F' <F'; 

3. 93 is a satisfying assignment for F'var((^); 

4. 93 is a satisfying assignment for F[var((y5)]. 

Obviously, ip is an autarky for F iff is an autarky for F\ {±} iff is an autarky for 
the underlying clause-set. The set of all autarkies for F is denoted by Auk(F); it 
is Auk(F) a sub-monoid of TiASS, containing all satisfying assignments for F, and 
Auk(F) = Auk(t(F)). If F' < F, then Auk(F) C Auk(F'), and for finite V CVA 
we have 

{ip e Auk(V^ * F) : var{ip) nV = lll} = {ip e Auk(F) : yar{ip) n = 0}, 
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that is, the autarkies for F which do not use variables from V are exactly the 
autarkies for V * F (which do not use V). Furthermore we have 

Auk(Fi + F2) = Auk(Fi) n Auk(F2). 

Hip ^ Auk(F) and ip g Auk((^ * F), then tpoip e Auk(F). An autarky ip e Auk(i^) 
is called non-trivial if var((p) n var(F) ^ holds. F is called lean, if F has no non- 
trivial autarky; the set of all lean multi-clause-sets is denoted by CSAAf. Every 
F having only variables with trivial domain (i.e., one-element domains) is lean. A 
sum of lean multi-clause-sets again is lean. If F is lean, so is F * for V C VA. 

An autarky reduction is a reduction F t-^ (p * F for some non-trivial autarky 
(p for F (note that p * F is satisfiability equivalent to F). Autarky reduction is 
terminating and confluent (generalising Lemma 4.1 in |47| , a special case of Lemma 
3.7 in p6|), and thus the result of iterated autarky reductions until no further 
reductions are possible is uniquely determined; we denote it by ^AukiF) < F. 
The operator Na := NAuk is a "kernel operator", that is, Na(F) < F, Na(Na(F)) = 
Na(F), and Fi < F2 =^ Na(Fi) < Na(i^2); furthermore Na(F) is satisfiability 
equivalent to F, and Na(F) = T iff F e SAT. We have Na(i^) G CSAAf, and 
Na(-F) is called the lean kernel of F; F is lean iff Na(F) = F. There exists an 
autarky p G Auk(F) with Na(F) = p ^ F (while for all p G Auk(F) we have 
Na(-F) < p * F). Na(i^) is the largest lean sub-multi-clause-set of F. 

An autark sub-multi-clause-set of F is an induced sub-multi-clause-set F' 
of F, such that there exists an autarky p G Auk(i^) so that for C G F we have 
C & F' iS (p * {C} = T (note that in this case we have F' = Fvar(i^)). The set of 
autark sub-multi-clause-sets of F is closed under union, and contains the smallest 
element T and the largest element F \ Na(F). A fundamental observation is that 
_F' < F is an autark sub-multi-clause-set of F iff there is y C var(F) with Fy = F' 
and F[V] G SAT. 

The relation between the lean kernel of F and the largest autark sub-multi- 
clause-set of F can be summarised as follows: For F G MCCS there exist in- 
duced sub-multi-clause-sets Fi,F2 < F with Fi -I- F2 = F, such that Fi is lean, 
while var(Fi) * F2 is satisfiable; in this decomposition Fi,F2 are uniquely de- 
termined, namely Fi = Na(F) is the largest lean sub- multi-clause-set (the lean 
kernel), while F2 is the largest autark sub- multi-clause-set; furthermore we have 
F2 = ^'var(F)\var(Fi), and thus var(Fi) * F2 = F[var(F) \ var(Fi)]. 

For an example consider variables a, b, c, d with Da = = {0, 1, 2} and Dc = 
Dd = {0, 1}, and consider the clause-set 

F Fi U F2 

Fi {{a^O,MO}, {a7^0,6y^l}, {a^0,M2}, {a^l}, {a^2}} 

F2 {{a^O,c^O,d^l},{b^O,c^l,d^O}}. 

To see whether there is an autarky for F invoking exactly one variable we check 
satisfiability of F[{w}] for v G {a,b,c,d}; we see that all these clause-sets are un- 
satisfiable (e.g., F[{c}] = {{c 0}, {c ^ 1}}), and so the smallest non-trivial 
autarky for F (if there is any) must involve at least two variables. Now F[{c, d}] = 
{{c ^ 0, d 7^ 1}, {c ^ 1, d 7^ 0}} G SAT, and thus the two partial assignments 
(c, c? 0) , (c, d — )■ 1) are autarkies for F; applying any of them yields Fi, which is 
lean (Fi actually is minimally unsatisfiable), and thus Fi is the lean kernel of F, 
while F2 is the largest autark sub-clause-set of F. 
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1.4.2 Autarky systems 



After having reviewed the general lacts for autarkies for generahsed multi-clause- 
sets, we now consider "autarky systems" . The motivation is that instead of (com- 
putationally hard) general autarkies we want to consider more feasible, restricted 
notions of autarkies. Under mild assumptions on these restricted autarkies all the 
above facts carry over (in generalised form). A definition of "autarky systems" for 
abstract "systems with partial instantiation" has been given in |Q; the monoid 
{TIASS, o, 0) together with the partial order {A4CCS, <, T) with least element and 
together with the operation * of VASS on MCCS fulfils all the axioms required in 
Section 3 of [Q, and thus all the general results there on autarky systems can be 
carried over. Again we refer to the handbook chapter psf for more information. 

An autarky system for generalised multi-clause-sets is a map A, which assigns 
to every F G MCjCS a sub- monoid A{F) of Auk(_F), such that for Fi < F2 we have 
A{F2) C ^(Fi). The elements of A{F) are called ^l-autarkies for F. Further 
possible restrictions on A are expressed by the following notions: 



1. ^ is iterative, if for (p G A{F) and tp G A{if*F) we always have iljo(p g A{F). 

2. ^ is called standardised, if for a partial assignment (p G TASS we have 
(fi G A{F) iff (fi I var(F) G A{F) (where ip \ var(F) is the restriction of the map 
ip to the domain var(<p) H var(_F)). (Remark: Thus for a standardised autarky 
system A all partial assignments tp with yslt{p) D var(i^) = are (trivial) A- 
autarkies for F. In Q only the direction "(^ G A{F) p \ var{F) G A{F) 
was required, but now it seems more systematic to me to require also the 
other direction.) 

3. A is ±-invariant, if always A{F) = A{F + {±}) holds (in 0, |^ this was 
called "normal"). 

4. A is invariant under variable elimination, if for finite V C VA we always 
have {p G A{V * F) : vaiip) DV ^ii} ^ {p e A{F) : var((^) n = 0}. 

5. ^ is invariant under renaming, if for every F' obtained from F by renaming 

V to w using h (recall Subsection 1.3.5 ) and for every autarky p G A{F) we 
have p>' G A{F') for the partial assignment ip' obtained from p> by renaming 

V to w using h. 

6. A is stable for unused values, if for p G A{F), v G dom{p) and for e G Dy 
such that none of the two literals {v, p{v)), {v, e) occurs in F, also p' G A{F) 
holds, where p' is obtained from p by hipping v to s. 



An autarky system A is called normal, if it fulfils these six criteria, that is, if it is 
iterative, standardised, _L-invariant, invariant under variable elimination, invariant 
under renaming and stable for unused values. Considering the boolean case (where 
stability for unused values is covered by the standardisation condition, while in- 
variance under renaming was not considered), in [^6[ "normal autarky systems" 
were called "strong autarky systems" , but meanwhile the above properties turned 
out not to be so strong but quite "normal" , while "ab-normality" is a defect which 
can be repaired; see for example Lemma 8.4 in ^7\ , which can be generalised to 
generalised clause-sets. Examples for normal autarky systems are the smallest stan- 
dardised autarky system F G MCCS ^ {p e VASS : var((y9) n var(F) = 0} and the 
largest autarky system F G AiCCS 1— > Auk(F). In this article our main interest is 
in normal autarky systems, and we don't investigate the detailed relations between 
the above notions and the other properties of autarky systems, but we will state 
general results either for all autarky systems or for all normal autarky systems. 

Consider an autarky system A. 
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• An ^-reduction is a reduction F tp * F ior some non-trivial (p £ A{F). 

• Since multi-clause-sets have finite variable sets, ^-reduction is terminating, 
and thus by Lemma 3.7 in y^-reduction is confluent, and the result of 
applying y^-reductions as long as possible is uniquely determined, yielding a 
normal form Nx(-F) < F. As before, the operator is a kernel operator, 
that is 

1. Na{F) < F, 

2. N^(N^(F)) - ^a{F), 

3. Fi<F2^ Na{Fi) < N^(i^2). 

• Multi-clause-sets F with N_4(F) = T are called ^-satisfiable, while in case 
of Na{F) — F we call F ^-lean; the set of all ^-satisfiable multi-clause-sets 
is denoted by SAT^a, the set of all y^-lean multi-clause-sets by CS-AMjs^. 

• F is yl-lean iff AiF) contains no non-trivial autarky. The learn kernel N_4(F) 
is the largest ^-lean sub-multi-clause-set of F . A sum of ^-lean multi-clause- 
sets again is ^-lean. 

• Finally F is called minimally ^-unsatisfiable, if F is not ^-satisfiable, but 
every F' < F with F' F is A-satisfiable, while F is called barely ^-lean if 
F is ^-lean, but every F' < F with c{F') = c{F) — 1 is not A-lean; for more 
on these two notions see [^3| , while in this article we will consider only some 
basic properties of minimal ^-unsatisfiability. 

• If F is minimally ^-unsatisfiable, then F is ^-lean, and for F ^ {±} it 
is F also barely ^-lean. If A is the full autarky system, then minimal A- 
unsatisfiability is just (standard) minimal unsatisfiability. 

For the remainder of this subsection now assume that the autarky system A is 
normal. Then F is ^-satisfiable iff there exists ip G A{F) with (p * F = T. More 
generally, there always exists ip G A{F) with (p*F — N^(F). If F is ^-lean, then so 
is V * F and F[V] for finite V C VA. The ^-autark sub- multi-clause-sets of F, i.e., 
those multi-clause-sets F' where there is (p £ A{F) with F' — -F'var(<p), are exactly 
those Fv for some V C var(F) where F[V] is ^-satisfiable. The set of ^-autark 
sub-multi-clause-sets of F is closed under union, and contains the smallest element 
T and the largest element F \ N^(F). As before, the relation between the ^-lean 
kernel of F and the largest ^-autark sub-multi-clause-set of F can be summarised 
as follows: 

Lemma 1.4.1 For F £ A4CCS there exist induced sub-multi-clause-sets Fi,F2 < 
F with Fi + F2 = F , such that Fi is A-lean, while var(Fi) * F2 is A-satisfiable. 
In this decomposition, Fi , F2 are uniquely determined, namely Fi = (F) is the 
largest A-lean sub-multi- clause- set (the A-lean kernel), while F2 is the largest A- 
autark sub-multi-clause-set. Furthermore we have F2 = F'var(_F)\var(Fi)7 '"^'^ thus 
F' := vsLi{Fi)*F2 — F[var(F)\var(Fi)]; the multi-clause-set F' is A-satisfiable, and 
every A-satisfying assignment p> for F' ( with var((y9) C var(F') = var(_F2) \ var(Fi) ) 
is an A-autarky for F with p * F — Fi and Fvar(ip) — ^2. 

We finish our review on autarkies and autarky systems by generalising Lemma 
8.6 in The proof can be literally transferred to our generalised context, and 

thus is not reproduced here. 



22 



Lemma 1.4.2 Let A be a normal autarky system. Given decision of membership 
in CEAMa o,s an oracle, the normal form F i— Na,{F) for F G AiCCS can be 
computed in polynomial time as follows: 

1. IfFe CEAMa then output F. 

2. Lei var(F) = {di, . . . ,w„(F)}. 

3. Since {) * F = F ^ CEANa and vai{F) *F = c{F) ■ t({_L}) G CEAJ^a holds, 
there is an index I < i < n{F) with 

{vi, . . . ,Vi^i} * F ^ CEANa and {vi, . . . ,Vi\ * F e CEANa- 

Replace F by the induced sub-multi-clause-set of F given by the clauses of F 
not containing variable Vi, and go to Step\^ 

The output of this procedure is Na(-F). If we use V for the set of variables Vi selected 
in Step 1^ then there exists an autarky Lp for F with var(i^) = V , and Fv is the 
largest autark subset of F (note F is the original input). Thus an autarky ip for 
F with if * F — Na(i^) is obtained by an A-satisfying assignment ip for F[V] (with 
var((p) QV). 



The idea behind the algorithm of Lemma L4.2 is that we want to find a variable v 



such that there exists an autarky Lp for F with v G var(iy9); if there is no such vari- 
able, then F is lean while otherwise we can eliminate all clauses from F containing 
variable v. Now the variable Vi selected in Step |^ must be such a variable: Consider 
a non-trivial autarky ipi for Fi :— {vi, . . . * F with var((/3i) C var(i^i). Since 

{vi\ * Fi is lean, Vi G var((y9i) must be the case, while ^pi is an autarky also for F . 
For more on such reductions (for boolean clause-sets and the full autarky system) 
see |65[ , where it is also shown, given an oracle for .A- leanness decision, how to 
find a non-trivial autarky (if existent), which through iteration yields an autarky 
tf G A{F) with (f * F = N^(F). This works via addition of unit-clauses, similar 
to the usual self-reduction, but now used together with crossing out of variables as 
above, obtaining a minimal non-trivial autarky (if existent). 



1.5 Resolution 



In this sectio n we d iscuss the fundaments of resolution for generalised clause-sets. 
In Subsection 1.5.l| the obvious generalisation of the boolean resolution rule is dis- 
cussed, deriving a (generalised) resolvent-clause from (generalised) pa rent-c lauses. 
The duality between resolution and autarkies is presented in Subsection 1.5.2, show- 
ing that the clauses satisfiable by some autarky are exactly those not usable in any 
resolution refutation. Finally in Subsection 1.5.3| the special case of "Davis-Putnam 
resolution" is considered, due to its importance for reductions. 



1.5.1 The resolution rule 

Let us consider for the moment the most general form of (variable-based) CNF- 
clause-sets, that is, negative signed clause-sets (recall the discussion in Subsection 
|1.1.3| ), where the literals are of the form "w ^ S"' for some S C (so literals are 
unary constraints, but presented by their sets of falsifying values)]^ Representing 

^*^If only one sign is present ("monosigned"), then for CNF the negative sign is appropriate, and 
for DNF the positive sign, since CNFs exclude total assignments, while DNFs mclude them. 
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literals "v ^ S" by pairs {v, S), the most general resolution rule allows for given neg- 
ative signed clauses Ci, . . . , Cm (w > 0) with Xi = (f , Si) £ Ci for « G {1, . . . , m} 
the formation of the resolvent R := (Uie{i,...,m} '^^ \ i^J) ^ Uje{i,....m} 
a negative signed clause collecting all literals from the Ci except of the Xi, and 
including additionally the literal on variable v whose (negative) sign is the union 
of the (negative) signs of the Xi; note that (w,0) represents truth value 1 (is always 
true), while {v,Dv) represents truth value (is always false). This most powerful 
resolution rule captures immediately the power of resolution on the direct boolean 
translation (which needs to use both ALO and AMO clauses here), and in fa ct the 
latter can also polynomially simulate the former. As discussed in Subsection |l.l.3| , 
also when starting with (just) generalised clause-sets, using the general resolution 
rule for signed clause-sets can yield polynomial-size refutations where ordinary res- 
olution on generalised clause-sets (as discussed in this section), which stays inside 
the class of generalised clause-sets, has only exponential-size refutations. However 
in this article we consider resolution mainly as a "combinatorial tool" , and then 
(restricted) resolution for generalised clause-sets is appropriate. 

The resolution rule for generalised clause-sets is well-known. A thorough study is 
approached in , where actually resolution is considered for general "fipa-systems" 
(systems with finite instantiation by partial assignments), by reducing resolution 
for such axiomatic systems to resolution for generalised clause-sets through the use 
of "no-goods", i.e., out of the general system we get the clauses C belonging to 
the resolution refutation as clauses C^ associated with such partial assignments 
Lp which led to the contradictions. We now review the most basic notions, with 
special emphasise on the fundamental duality with the notion of autarkies. A 
technical comment: For autarky systems the number of occurrences of a clause in 
a multi-clause-set might make a difference (as it is the case for matching autarkies 
introduced in the Section p^.9| ), however for all known resolution systems we do 
not need this distinction, and thus only (generalised) clause-sets are considered for 
resolution (that is, if multi-clause-sets F € MCCS are to be treated, then they are 
automatically "downcast" to the underlying clause-set t(F)). 

Consider a variable v e VA. "Parent clauses" Ci, . . . , C\£,^\ are called resolv- 
able with resolution variable u, if vali,({Ci, . . . tC\u^\]) = Dy and the resol- 
vent R := * actually is a clause (contains no clashing literals), that is, 
whenever there are literals x G Ci, y ^ Cj for some i, j € {1, . . . , \Dy\} with x ^ y 
and var(a;) = var(2/), then var(a;) — v must be the case. Resolution is a complete 
and sound refutation system; see for example Corollary 5.9 in |5^, where, translat- 
ing branching trees into resolution trees, the existence of a resolution tree with at 
most (max„gvar(F)|^ii|)"'^'^^ many leaves for unsatisfiable generalised clause-sets F 
is shown. Also stated in is the (well-known) "strong completeness" of resolu- 
tion, that is, for a multi-clause-set F € MCCS and a clause C G CC we have F \= C 
iff there exists a resolution tree with axioms from F deriving a clause C C C . 



1.5.2 The duality between resolution and autarkies 

In Theorem 3.16 in |^ the "resolution-autarky duality theorem" was shown for 
boolean clause-sets, namely that the lean kernel of a clause-set F consists exactly 
of all clauses C G F which can be used in some resolution refutation of i^.^ A 



predecessor is |25|] , where it was shown that when a tableau-calculus based (boolean) 
SAT solver gets "stuck", i.e., can not find a refutation based on the current top- 
clause, then an autarky satisfying this top-clause can be extracted, while clauses 



l5)\YJiere the resolution refutation may not contain "dead ends", which can be most easily 
enforced by considering only resolution trees. 
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satisfied by some autarky can not participate in any tableau- refutation.^ In p6[ [, 
Theorem 4.1, the generalisation of the duality-theorem to generalised clause-sets 
is stated, but without a proof, which we now outline. Consider the set U{F) of 
clauses C G F for which there exists a tree resolution refutation of F using C as 
an axiom. The direction, that a clause C G F \ Na(-F) can not be used in tree 
resolution refutations of F (i.e., U{F) C Na(F)), is easily proved by induction: an 
autarky of F satisfying C is an autarky for all clauses in the tree and satisfies all 
clauses derived from C. For the reverse direction the main technical lemma is, that 
for each variable v G var(J7(F)) and each e € Dy the unit-clause {(«,£)} can be 
derived from U{F) by resolution (this is a little proof-theoretic exercise; see Lemma 
3.14 in [|5| for the boolean case). Now it follows that F \ U{F) is an autark sub- 
clause-set of F, since if the clause-set var([/(F)) * {F\U{F)) would be unsatisfiable, 
then there would be a tree resolution refutation T of var({/(i^)) * (i^ \ U{F)), where 
the axioms of T could be derived from the clauses in F \ U{F) and the clauses in 
U{F) by the above technical lemma, and thus we could construct a tree resolution 
refutation involving some clause of \ U{F), contradicting the definition of U{F) 
(compare with Lemma 3.15 in |^ for the boolean case). That F\U{F) is an autark 
sub-clause-set of F means Na(F) C U{F), and altogether we have shown: 

Theorem 1.5.1 For any generalised clause-set F G CCS the lean kernel Na(F) 
equals the set U{F) of clauses of F usable in some (tree) resolution refutation of 
F. So F is lean if and only if F ^ U{F), that is, if every clause of F can he used 
in some (tree) resolution refutation of F. 



As shown in Section 6 of |4q]. Theorem 1.5.1 yields an algorithm for computing 



Na(F) by using "intelligent backtracking solvers", which on unsatisfiable instances 
can return the set of variables used in some resolution refutation of the input. 
Crossing out these variables from the input, removing the empty clause obtained, 
and repeating this process, we finally obtain a satisfiable clause-set F*, and now 
any satisfying assignment (p for F* with vai{Lp) C var(F*) is an autarky for F with 
if * F — F \ Na(i^). See for more details on this computation of the lean kernel 
(in only boolean clause-sets are considered, but based on the results of the 
present article, all (mathematical) results can be generalised in the natural way). 

Corollary 1.5.2 Consider a class C C CCS of generalised clause-sets, such that for 
all unsatisfiable F d C and all V C VA we have F[V] G C. Assume furthermore that 
there is an algorithm running for inputs F G C in polynomial time, which either 
computes a satisfying assignment for F or computes the set of variables used in 
some tree resolution refutation of F. Then for inputs F £C the lean kernel Na(i^) 
is computable in polynomial time. 



A general source of classes C as required in Corollary 1.5.2 are the classes 
QkiU,S) for levels fc G No and suitable oracles U for unsatisfiability and C for 
satisfiability, as introduced in In this way for example we get poly-time com- 
putation of the lean kernel for generalised Horn clause-sets (allowing non-boolean 
variables as well as considering higher levels k of "Horn-structures"). For further 
information on algorithmic problems related to autarkies see [p5|. 



1.5.3 DP-resolution 



We conclude this section by defining the Davis-Putnam operator DP for gener- 
alised clause-sets. Consider a clause-set F G CCS and a variable v G var(i^). Let 

^^^We remark that [tsI extends this approach by parallelisation, exploiting the observation (made 
there first) that the combination of autarkies yields again an autarky. 
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Fy be the set of all resolvents of parent clauses in F with resolution variable v. Now 
we set DP„(F) := {C G F : v ^ var(C)} U Fy. From the completeness results for 
(generalised) resolution in |Q it follows immediately that DPy{F) is satisfiability 
equivalent to F, and that F is unsatisfiable if and only if by repeated applications 
of the Davis-Putnam operator we finally obtain the clause-set {_L} (while for satis- 
fiable F finally we will obtain the clause-set T). We can also generalise Lemma 7.6 
in [ |63[ about the commutativity of the Davis-Putnam operator, that is, if Gi is the 
result of applying first DPvi, then DP^^, and finally applying DP^,^^, while G2 is 
the result of applying first DP^^^^^^ , then DPy^^^^, and finally applying DP^^^^^^^ , 
for some permutation tt e Sm , then after elimination of subsumed clauses in Gi and 
G2 (see the following subsection) Gi becomes equal to 6*2- It follows that for any 
set of variables V the operator DPy , computed by running through the variables of 

V in some order, is uniquely determined up to subsumption reduction in the result. 
We always have DPy(F) = DPy(Fy) U{F\Fv). If for some V C var(F) we have 
DPv{Fv) — T, then F and F \ Fy are satisfiability equivalent, generalising the 
elimination of autark sub-clause-sets: If e Auk(_F'), then P>PYa.r(ip){Fvai(ip)) = T, 
while the reverse direction need not hold, as the example F = {{u, a}, {iJ, a}} U F', 

V ^ var(F'), with V = {v} shows (for boolean variables). We see that the Davis- 
Putnam operator, whose application for generalised clause-sets is semantically the 
same as existential quantification (that is, the results are logically equivalent), yields 
more powerful reductions than autarky reduction, but this at the cost of potential 
exponential space usage. 



1.6 Reductions 

In this section we review the basic polynomial-time reduction concepts. For a 
thorough discussion in the boolean case, see [Q. We consider only clause-sets 
F G CCS., but all results are easily generalised to multi-clause-sets. In Subsection 
1.6.1 the most basic reductions are considered, and in Subsection 1.6.2| reductions 



by means of DP-resolution are presented. 



1.6.1 Basic reductions 

The most basic reduction (by which we mean a satisfiability-equivalent transforma- 
tion, simplifying the clause-set in some sense) is subsumption elimination, the 
elimination of subsumed clauses, i.e., the transition F F \ {C} ior G G F in 
case there exists G' G F with C" C G. Iterated elimination of subsumed clauses 
is confluent, and thus the result of applying subsumption elimination as long as 
possible is uniquely determined (namely it is the set of all minimal clauses of F) ; if 
F has no subsumed clauses, then we call F subsumption- free. 

The next reduction can be called the trivial-domain reduction: If there exists 
V G var(F) with \Dy\ — 1 (we call such a variable trivial), then for Dy — {e} 
reduce F {v ^ s) * F (that is, all literal occurrences with underlying variable v 
are removed). 

Elimination of "pure literals" is now better called elimination of pure vari- 
ables: If there is u G var(F) with |val^(F)| < \Dy\, that is, one of the values of v is 
not used in F, then for some e S Dy \ valy{F) reduce F (u — e) * F. This is the 
basic form of a pure autarky as mentioned in Subsection 4.4 of p6[ . 

Unit-clause elimination for generalised clause-sets is less powerful than in the 
boolean case: If F contains a unit-clause {{v,e)} G F, then in case of Dy = {e} by 
trivial-domain reduction we conclude that F is unsatisfiable, but otherwise we can 
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only conclude that value e is to be excluded from the domain of v, and in general we 
can not eliminate the variable v. It seems most convenient here to include trivial- 
domain reduction into unit-clause elimination (so that we properly generalise the 
boolean case); in our context, where we fixed the domain of each variable (and thus 
renaming is needed to achieve a change of domain), unit-clause propagation for 
(generalised) clause-sets F is then the following procedure: 

1. Apply trivial-domain reduction to F to eliminate all trivial variables. 

2. If ± e F, then reduce F to 

3. For {{v,e)} 6 F eliminate all clauses containing the literal {v,e) from F, and 
replace variable v in the remaining occurrences by a new variable v' with 
D,, - D, \ {e} ^ 0. 

4. Repeat Steps |l| - § until all trivial variables and all unit-clauses have been 
eliminated. 

So after unit-clause propagation every variable has a domain with at least two 
elements and, except of the case F — {!.}, every clause contains at least two literals. 
Unit-clause propagation for boolean clause-sets is confluent (the final result does 
not depend on the order and choice of single reduction steps), while unit-clause 
propagation for arbitrary clause-sets is confluent modulo renaming. Generalising 
the well-known linear time algorithm for unit-clause propagation in the boolean 
case, this normal form (the result of unit-clause propagation) can be computed in 
linear time. 

Considering clauses C G CC as constraints of scope var(C) (see Subsection 
2.1.1), and thus clause-sets F e CCS as constraint networks (or constraint satis- 
faction problems), the basic reduction concepts for constraint satisfaction problems 
are translated as follows: 

1. F is fc-regular (in natural generalisation of Definition 5.14 in [||) iff for every 
variable set V C var(F) with \V\ < k there is at most one clause C € F with 
var(C) = V. Due to the restricted power of (generalised) clauses, this notion 
seems to be not of great relevance here (on the other side, it might make sense 
to comprise binary clauses and possibly also ternary clauses with the same 
scope into constraints). 

2. is relational arc-consistent (every assignment to all but one of the variables 
of a clause C <E F can be extended to a satisfying assignment for C; see ]l6[| . 
Definition 8.1) iff F does not have trivial variables. 

3. F is (hyper-)arc-consistent (every assignment to one variable of a clause C £ F 
can be extended to a satisfying assignment for C; see ||l^. Definition 3.6) iff 
for all C e F \ {±} after trivial-domain reduction at least two literals are left. 

So unit-clause propagation achieves both hyper-arc-consistency and relational arc- 
consistency. More information on the relations to the basic reduction concepts for 
constraint problems one finds in Subsection 3.3 of [^5|, while more extensive studies 
have been performed by David Mitchell. 

1.6.2 DP-reductions 

Finally we consider the most harmless cases for DP-reductions. In general, appli- 
cation of DPt, to F eliminates #«(F) = X^eg^i^ #(d,e)(-F) clauses and creates up to 
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neGi? clauses (with potential repetitions; less iff some of the parent 

clause combinations are not eligible for resolution due to additional clashes). Thus 
we have 

ciBF^F)) < c(F) - J2 + n #(v,e){F)- (1-1) 

eeD^ eeD^ 



Note that equality holds in (1.1) if w is pure for F (which is equivalent to the product 
being zero). If in (^]^) we have a strict inequality or u is a pure variable for 
then we call v a degenerated DP-variable w.r.t. F, while otherwise v is called a 
non-degenerated DP-variable vif.r.t. F. Note that a missing new clause due to 
additional clashes is not the only cause of a strict inequality, but it is also possible 
that a resolvent is already contained in the rest of F^ or that two resolvents coincide 
(and thus in both cases contraction occurs). Two trivial cases: 

1. If variable v G var(F) has a trivial domain (i.e., \Dj;\ = 1), then either we 
have a subsumption C,C U {{v,e)} S F (for some clause C not containing 
v), in which case w is a degenerated DP-variable w.r.t. F, or otherwise v is 
a non-degenerated DP-variable with c(DPy{F)) = c{F), and in both cases 
DP^(F) is the result of applying trivial domain reduction to F. 

2. If V is pure w.r.t. F, then F is a degenerated DP-variable with c{DPy{F)) = 
c{F) — ^y{F), and DP^(f ) is the result of applying the elimination of pure 
variable v to F. 



Besides these cases, in this article we consider only one very restricted form of DP- 
resolution, characterised by the condition that at most one of the factors in the 
product from (1.1) might be greater than one: 

We call a variable v a singular variable w.r.t. F if there exists e e Dy such 
that for all e' £ Dy \ {e} we have #{y.e'){F) — 1, while #{v,e){F) > Ij i-e., v is 
not pure for F. In such a case of a singular variable, application of DP« eliminates 
I-D^l — 1 + #{y,e){F) clauses and creates up to #{v,e){F) new clauses, so that the 
number of clauses goes down at least by one if \Dy\ ^ 1. For a singular variable v 
we have: 



• If w is a non-degenerated DP-variable w.r.t. F^ then we have c(DP^(F)) — 

c(F) - \Dy\ + l. 

• If t; is a degenerated DP-variable w.r.t. F, then at least one of the clauses in 
F containing v can be eliminated satisfiability-equivalently, and we call such 
a clause elimination a singular DP-degeneration reduction. 

Since a singular DP-degeneration reduction can not be applied to a minimally un- 
satisfiable clause-set, a singular variable w.r.t. a minimally unsatisfiable clause-set 
must be non-degenerated. Actually more can be said here: 

Lemma 1.6.1 Consider a generalised clause-set F G CCS and a singular variable 
V w.r.t. F . Then the following two conditions are equivalent: 

1. F is minimally unsatisfiable. 

2. V is a non-degenerated DP-variable w.r.t. F and DPy{F) is minimally unsat- 
isfiable. 



28 



Proof We have already seen that if F is minimally unsatisfiable, then v is non- 
degenerated. If DPy{F) were not minimally unsatisfiable, then there would be a 
clause C e DP„(i^) such that DPy{F) \ {C} would still be unsatisfiable, and thus 
would have a resolution refutation — now it is easy to see that in this case we would 
also obtain a resolution refutation of F not using one of the clauses in F. 

For the reverse direction assume that v is non-degenerated and that DPt,(F) is 
minimally unsatisfiable. By a similar argumentation as for the other direction, if 
there would be a resolution refutation of F not using one of the clauses from F, 
then one could construct a resolution refutation of DP^,(F) not using (at least) one 
of the clauses from DP^(F). I 

We call application of DP-reduction for non-degenerated singular variables non- 
degenerated singular DP-reduction. In the boolean case, this form of DP- 
reduction is used at many places in the literature (in pi] , Appendix B, it is called 
"(1, oo)-reduction" ); see Lemma 1.11.6 for more on singular DP-reduction. 

We conclude by another reduction arising from the DP-operator. The notion 
of blocked clauses for boolean clause-sets (see can be generalised by 

calling a clause C blocked w.r.t. F if there exists a variable v G var(C) with 
DP„(FU{C}) = DP„(F\{C}). If C e F is blocked w.r.t. F, then F is satisfiability 
equivalent to F \ {C}, and such a reduction is called elimination of blocked 
clauses. If w is a pure variable for F, then all clauses of F containing variable v 
are blocked w.r.t. F. And if w is a degenerated singular variable, then F has at 
least one blocked clause containing v, and so singular DP-degeneration reduction is 
also covered by elimination of blocked clauses. For recent practical applications of 
blocked clauses see [B2[- 



1.7 Matching satisfiability and deficiency 

We now generalise the basic method for satisfying (certain) clause-sets F via match- 
ings, as first systematically studied for the boolean case in |2^. The idea is to 
choose variables Vi so that each clause C G F contains at least one of them, that 
is, there is a literal xc G C with var(xc) = Vi for some i, and so that, whatever 
these associated literal occurrences xc are, there is a partial assignment Vi Si 
such that all literals xc become true — if this can be achieved, then obviously 



we satisfy F. Subsection |1.7.1| elaborates this id ea, int roducing "matching sat- 



isfiable" generalised clause-sets F. In Subsection 1.7.2 we generalise the notion 
of deficiency S{F), which has been introduced for boolean clause-sets F in as 
(5(F) = c(F) —n{F). Using the maximum deficiency ^*(F), the maximal deficiency 
taken over all sub-clause-sets of F, as for boolean clause-sets we obtain the charac- 
terisation of matching satisfiable clause-sets F by the condition 5*{F) ~ 0. Whence 
matching satisfiability can be decided in polynomial time by finding a maximum 
matching, which also yields a satisfying assignment (called a "matching-satisfying 
assignment") in the positive case. Two technical remarks are necessary here: 

1. Matching arguments are sensitive to repetition of clauses, and thus instead 
of just using clause-sets we have to use the more general notion of a multi- 



clause-set (recall Subsection 1.3.1). 



2. In case of a pure variable v € var(F) for some F e M.CCS (that is, not all 
values £ G Dy are used in F) we assume that Dy contains exactly one value 
not used in F (i.e., \Dy\ = |val„(F)| -I- 1). In this way we are not troubled 
anymore by the unknown domain size Dy, but we can measure the size of F 
just by (-{F), while this modification has no influence on any of the notions 
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and procedures in this article (all autarky systems studied here are stable for 
unused values (recall Subsection 1.4.2)). 



1.7.1 Matching satisfiable generalised clause-sets 

We wish to generalise the notion of "matching satisfiable clause-sets" , introduced 
in for boolean clause-sets. Consider a multi-clause-set F together with a de- 
composition F = Fi + ■ ■ ■ + F,n for TO e No and Fi E AACCS, fulfilling the following 
conditions: 

(i) for i G {1, . . • ,to} there are variables Vi € var{Fi) such that for all clauses 
C E Fi we have Vi G var(C); 

(ii) the variables vi, . . . , Vm are pairwise different; 

(iii) for all z G {1, . . . ,to} we have \Dy.\ > |val„. (Fj)!, that is, Vi is a pure variable 
for F,. 

We remark that none of the variables Vi needs to be a pure variable in F. Given 
such a decomposition, we see that F is satisfiable, since for each i there exists 
£i G Dtj. \ vali,;(Fi), and the assignment {vi ^ ei : i E {I, . . . ,to}) is a satisfying 
assignment for F . Considering an arbitrary partial assignment if satisfying F with 
var((^) = {vi, . . . , Wm}, and setting Fi for i £ {!,..., to} as the induced sub-multi- 
clause-set of F given by the clauses C E F with Vi G var(C) and {vi,(p(vi)) ^ C, we 
obviously fulfil the above conditions, and we see that conditions (J) - (|ij) need to be 
restricted so that we can obtain a class of satisfiable clause-sets which is decidable 
in polynomial time. We observe that c{Fi) > jval^. (Fi)| is true for arbitrary multi- 
clause-sets Fi, and thus condition 

([i^' for ah i G {1, ... , to} we have \Dy. \ > c{Fi) 

strengthens condition (pli[). We call multi-clause-sets F G M.CCS having a de- 
composition F = Fi + ■ ■ ■ + Fjn fulfilling conditions (|), ([]) and (pji])' matching 
satisfiable, and the set of all matching satisfiable (generalised) multi-clause-sets is 
denoted by MS AT. 

To understand the connection to matching problems, we introduce the bipartite 
graph B{F) for generalised multi-clause-sets F G A4CCS: 

• F := {{C,i):C <EF,ie{l,...,F{C)}} 

V := {(t;,j) It; Gvar(F), J G {!,..., 17^.1-1} }. 

The elements of F_ are called the clause-nodes, while the elements of V. are 
called the variable-nodes. 

• The vertex set of B{F) is defined as V(B{F)) :— F_\ijy_ with canonical bipar- 
tition {F,V). 

• The edge set E{B{F)) is the set of aU (undirected) edges {(C, i), {v,j)} such 
that V G var(C). 

In other words, the graph B{F) has as vertices i^(C)-many copies of clauses C € F 
together with (|-Dt,| — l)-many copies of variables v G var(i^), while edges connect 
copies of variables v with copies of clauses C such that v G var(C). We remark that 
variables v G var(i^) with trivial domain (i.e., = 1) do not occur in B{F), and 
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that for boolean clause-sets F the graph B{F) is the ordinary (bipartite) clause- 
variable graph. Consider for example the clause-set F = {Ci,C2,Cz] with Ci = 
{(wi,a), (u2,a)}, C2 = {(w2,&), (w3,&)}, C's = {("3, c), (ui, c)}, where \D^.\ = 3. 
Now B{F) is (suppressing the indices for the clause-copies, since here we just have 
a clause-set): 



B{F) 




For a set V of variables we obtain B{V * F) from B{F) by deleting the variable- 
nodes (w,j) of i?(F) with V ^V, while _B(i^[V^]) is the induced subgraph of B{F) 
given by the variable-nodes (w, j) of B{F) with w G together with their neighbours 
(those clause- nodes (C, i) with var(C) n F 7^ 0). Using the weighted number of 
variables 

wn(F):= (l^.l-l)eNo, 

fGvar(F) 

• the number of vertices of B{F) is \V{B{F))\ — c{F) + wn{F), 

• while the number of edges is \E{B{Fj)\ = E.evar(F) *v{F) ■ {\D,\ - 1). 
We have wn(F) = (Et,evar(i=^) ^ ^{P)- If P is boolean, then wn(F) 71(F). 

Lemma 1.7.1 A multi- clause- set F is matching satisfiable iff there exists a match- 
ing in B{F) covering all vertices of F_. 

Proof If F is matching satisfiable, then (using the notations in the definition of 
matching satisfiability above) the clause-nodes corresponding to the clause-occur- 
rences in Fi can all be covered by the variable- nodes belonging to Vi (since c{Fi) 
does not exceed the number of copies of Vi), and altogether we obtain a matching 
covering all clause-nodes. If (for the other direction) we have a matching in B{F) 
covering all vertices of F_, then for each variable v involved in the matching consider 
a sub-multi-clause-set F^ of F corresponding to the clause-nodes connected via the 
matching to the variable-nodes associated with v. These F„ together constitute the 
desired decomposition of F. I 

1.7.2 The deficiency of generalised clause-sets 

Let the deficiency of a (generalised) multi-clause-set F be defined as 

S{F) c{F) ~ wn{F) e Z, 
while the maximal deficiency is defined as 

S*iF) := max S{F') S Nq. 

We have 6*(F) > due to d{T) = 0, and by definition we have S*{F) < c{F). We 
remark that for a domain uniformisation F' of F we have S{F') — S{F) as well as 
S*{F') = S*{F); in principle we could consider only multi-clause-sets with uniform 
domains here, but the advantages in doing so seem to be negligible. 

Considering F' < F as a subset of F, the deficiency S{F') oi F' < F is just the 
deficiency of this subset in B{F) (as we have defined it for arbitrary graphs). By 
matching theory the maximal number of nodes of F coverable by some matching 
thus is c(F) — 6*{F). Summarising we have (generalising Lemma 7.2 in 0): 
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Lemma 1.7.2 Consider a generalised multi- clause- set F G A4CCS. 



1. The maximal size of a matching satisfiable sub-multi-clause-set F' < F is 
c{F')^c[F)-5*{F). 

2. F is matching satisfiable if and only if 6*{F) = 0. 

As an application we can generalise the well-known fact, apparently first men- 
tioned in the literature in [Q, that if a boolean clause-set F has minimal clause- 
length k and maximal variable occurrence k for some fc > 1, then F must be 
satisfiable (see for recent further developments): 

Corollary 1.7.3 Consider a generalised clause-set F G CCS containing a non- 
empty clause. Then 

; — < mm \Dy \ - 1 =^ t G MoAT ■ 

mmce-F|<-^| «Gvar(_F) 

Proof Assume the condition holds and consider F' C F. We must show 5{F') < 0. 
Let d min^,gvai(F)l^ti|- Then S{F') < c{F') — (d ~ l)n(F'), and a sufficient 
condition for S{F') < is < d ~ 1. Let a :— max^gvar(F) H'viF) and h := 

minceF|C|. We know c{F') ■ b < i{F') < n{F') ■ a, and thus < f • ■ 

Since matchings of maximal size can be computed in polynomial time (see Chap- 
ter 16 in [Q), we get the following poly-time results: 

Lemma 1.7.4 For every generalised clause-set F G A4CCS, in polynomial time in 
£{F) we can compute F' < F with F' G MSAT^ such that c{F') is maximal. Since 
F' — F iff F is matching satisfiable, it follows that whether F is matching satisfiable 
is decidable in polynomial time. And due to c{F') = c{F) — 5*{F) furthermore the 
maximal deficiency 5*{F) is also computable in polynomial time. 



1.8 Satisfying assignments versus matching-satis- 
fying assignments 



The theme of this section is the relation between general satisfiability and matching 
satisfiability. From the main result, Theorem 1.8.4, it follows that if a clause-set 



F is satisfiable, then it has a matching satisfiable sub-clause-set F' with at most 
5*{F) less clauses than F, and moreover there is a matching-satisfying assignment 
ipa for F' which can be extended to a satisfying assignment Lp for F using at most 
6*{F) additional variables. Furthermore, every satisfying assignment tp for F can 
be modified (in polynomial time) to become such an extension by means of flips 
of (single) variable assignments such that throughout the whole process we always 
have a satisfying assignment for -F.f^ As a direct application we obtain in Corollary 
1.8.7 that the hierarchy of clause-sets given by the p arame ter 5* allows polynomial- 
time SAT decision for each level. Later, in Theorem ^.3.5 (Part II), we w ill see that 



actually this hierarchy is fixed-parameter tractable, not u sing Theorem 1.8.4 , but 
by combining the structural results from Subsection 1.9.1 with the fix ed-parameter 
tractability of the boolean case. The main use of Theorem 1.8.4 will be given 



^'^'This additional property yields also new information for the boolean case; it is implicitly 
contained in the proofs from |l9| , which are not only generalised here, but also simplified in such 
a way that the construction becomes more lucid. 
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in Theorem 1.10.3, namely how to compute a non-trivial autarky (if existent) in 
polynomial time w.r.t. the maximal deficiency, and this result was not known even 
for the boolean case.P' 



1.8.1 The notion of matching-satisfying assignments 

Consider a (generalised) multi-clause-set F e A4CCS and a partial assignment 
(fi e TIASS. The partial graph B^(F) of B{F) is obtained from B{F) by keeping 
(exactly) all edges {{C,i), {v,j)} where ip satisfies the literal in C with underlying 
variable v (while keeping all vertices). In other words, all edges {{C , i) , {v , j)} 
are eliminated such that for the literal {v, e) G C we either have v ^ va.r{(p) or 
ip{{v,e)) = (i.e., ip{v) = e). So the non-isolated clause-nodes in B^{F) are 
(exactly) the clauses satisfied by Lp, while the isolated variable-nodes in Bip{F) are 
(exactly) the variables in F not used by ip to satisfy any clause. Now p is called 
a matching-satisfying assignment for F if Bip{F) contains a matching covering 
all clause- nodes (thus matching-satisfying assignments are satisfying assignments). 



By Lemma 1.7.l| we get the following basic polynomial-time result, refining Lemma 



1.7.4 by now also considering the (matching-)satisfying assignments. 



Lemma 1.8.1 A generalised multi-clause-set F G A4CCS is matching satisfiable if 
and only if there exists a matching-satisfying assignment for F. If F is matching 
satisfiable, then by computing a maximum matching M in B(F) we can efficiently 
compute a matching-satisfying assignment p for F as follows: The domain of p 
consists of the variables v used in variable-nodes {v,i) covered by M, while p{v) '■= e 
for some (chosen) value e G not used in the occurrences of variable v in the 
clauses corresponding via M to the variable-nodes using v (note that by definition 
there are most |Z?„| — 1 such variable-nodes). 



The following two lemmas give simple basic properties regarding these notions. 
First we consider how many variables need to be used by (matching-) satisfying 
assignments. 

Lemma 1.8.2 Consider a generalised multi- clause- set F G MCCS and a partial 
assignment p G VASS . 



1. If p is satisfying for F, then there exists p' ^ p with n{p') < c{F) such that 
also p' is satisfying for F. (So for satisfying any clause-set we never need to 
use more variables than there are clauses.) 

2. If p is matching-satisfying for F, then there exists p' '^p with n{p') = c{F) 
such that also p' is matching-satisfying for F. (So for a matching satisfiable 
multi- clause- set there is a matching-satisfying assignment using exactly as 
many variables as there are clause- occurrences.) 

3. If p is satisfying for F , and there is no pJ p with n{p') < c{F) such that 
p' is satisfying for F , then p is matching-satisfying for F . 

4. Consider a minimal satisfying assignment p for F w.r.t. the canonical partial 
ordering of partial assignments (that is, there is no p>' d p which still satisfies 
F). Then p is matching-satisfying for F if and only if n{p) — c{F). 



^*'That the lean kernel is computable in polynomial time w.r.t. the maximal deficiency has 
already been shown in [ p^ , but interestingly it was not known how to find a (non-trivial) autarky 
(either just some, or a quasi-maximal one, yielding the lean kernel — both tasks are basically the 
same) . 
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Proof The partial assignment ip' for Part |^ is obtained by removing edges from 
Bip{F) until every clause- node has degree 1, and using then only the variables from 
(p which are still covered. (/?' for Part |2| is obtained in a similar way, only this time 
we remove all edges not contained in some (selected) maximum matching of (F) . 
Part H is shown by Hall's criterion as follows: Assume that there is F' < F such that 
the number jPs (f){F')\ of neighbours of F' in B^{F) is strictly smaller than c(F'), 
and let ip' := ip\ Tb^{f) (F') be the restriction of ip to these neighbours; by definition 
w' is a satisfying assignment for F' with n{(p') < c{F'). Let F" := F—F'. With Part 
ythere is a satisfying assignment Lp" C ip for F" with n{ip") < c{F") — c{F) — c{F'). 
Now let (p* := (f' U ip"; by definition (p* is a satisfying assignment for F with 
^(<y5*) < n{(p')+n{ip") < c{F') + c{F)~c{F') = c{F) contradicting the assumption. 
Finally Part | follows by Parts || and |. I 

Now we show that every sub- assignment of a matching-satisfying assignment is 
also matching-satisfying for a suitable (multi-)clause-set. 

Lemma 1.8.3 Consider a generalised multi-clause-set F G A4CCS and partial as- 
signments (p,ip Q VASS . If (p o ip is matching-satisfying for F , then (p is matching- 
satisfying for ijj * F. 

Proof Let M be a matching in B^o4){F) covering all clause-nodes. The bipartite 
graph -B^(V' * F) is obtained from B^o-ip{F) by removing all clause-nodes satisfied 
by tjj, removing all variable-nodes assigned by ip, and finally removing all variable- 
nodes where the variable does not occur in ip * F anymore. Now those edges from 
M which are still in B^o^{F) yield a matching (obviously, since only edges have 
been removed) covering all remaining clause-nodes (they were covered before, and 
only useless edges have been removed). I 



1.8.2 Matchings within satisfying assignments 

As we already remarked, the matching number v{B{G)) of the clause- variable graph 
of G, the maximum size of a matching in B{G), is v{G) — c{F) — 5*{F). Obvi- 
ously for partial assignments ip we have v{B^p{F)) < i'{ B{F) ); call ip matching- 
maximum if v{B^p[F)) = v(B{F)) holds. By Lemma 1.7.2| we know that there 



exists a matching-maximum partial assignment for every clause-set. The main re- 
sult of Section |l.8| is the following theorem, which states that every partial as- 
signment can be efficiently repaired by "conservative changes" , so that we obtain 
a matching-maximum partial assignment. Here by a conservative change of a 
partial assignment ip w.r.t. a clause-set F we mean either adding some assignment 
Dy for some v G var(F) \ Ym:{ip), or performing a conservative flip, 
that is, changing the value v e y'at:{lp) ^ p^iv) to some e G Dy \ {<p(u)}, obtaining 
p}\ such that all clauses of F satisfied by ip are also satisfied by p>' (note that this 
property holds automatically for the first type of change, the extension of Lp). So 
if we have a sequence of conservative changes, then the corresponding sequence of 
sub-sets of satisfied clauses is monotonically increasing; especially if we start with 
a satisfying assignment, then all partial assignments in the sequence will also be 
satisfying assignments. 

Theorem 1.8.4 For a generalised multi- clause- set F G A4CCS and a partial as- 
signment ipo, in polynomial time a sequence of conservative changes w.r.t. F , start- 
ing with ipo, can be computed such that the finally obtained partial assignment tp is 
matching-maximum for F. 

Before proving this theorem, we derive three corollaries. 
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Corollary 1.8.5 For a satisfiable generalised multi-clause-set F G A4CCS there 
exists a satisfying assignment which is matching-maximum. 

We obtain the following generalisation of Theorem 7.16 in pTf : 

Corollary 1.8.6 For each satisfiable generalised multi- clause- set F G A4CCS there 
exists a partial assignment ip G TiASS with n{ip) < 5*{F) such that (p*F is matching 
satisfiable. 



Proof By Corollary 1.8.5 there exists a satisfying assignment (pQ for F and F' < F 
with c(F') = c{F) — S*{F), such that ipo is matching-satisfying for F' . Let F" := 
F - F'. We have c(F") = 5*{F) and (^o is satisfying for F", so by Lemma |l.8.2| , 
Part |l| there exists Q po with n[ip) < S*{F) such that </? is satisfying for F". 
Now ipiQ — ifQ o (p is matching-satisfying for F' , and thus by Lemma 1.8.3 (po is 
matching-satisfying for ip ^ F' — ip * F' -\- tp F" — ip * {F' -\- F") = ip * F. I 



Corollary 1.8.7 The satisfiability problem for generalised multi- clause- sets F with 
S*(F) < k for constant /c G Nq is decidable in polynomial time (and if F is satisfi- 
able, then a satisfying assignment can be computed). The algorithm runs through all 
partial assignments ip with vai{ip) C var(i^) and n{ip) < S*{F) and checks whether 
ip* F is matching satisfiable: If yes then for a matching-satisfying assignment for 
ip* F we obtain a satisfying assignment ipo^ for F , while if no matching satisfiable 
sub-instance was found in this way, then F is unsatisfiable. 

In it was shown that in the boolean case the satisfiability problem for bounded 
maximal deficiency actually is fixed-parameter tractable. By reducing the general 
case to the boolean case, we will show fixed-parameter tractability in Theorem 
2.3.5| (Part II) also for generalised cla use-set s. So just for satisfiability decision for 



bounded maxima deficiency, Theorem 1.8.4 is in a sense superseded by a different 



method, but we will later show in Theorem 1.10.3, that actually a stronger result 



can be obtained from Theorem 1.8.4 , namely computation of the lean kernel in 
polynomial time. 



The remainder of this subsection is devoted to the proof of Theorem 1.8.4 (gen- 
eralising, simplifying and also strengthening the results on "admissible matchings" 
in [M ) . In order to bring out the general structure of the p roof we will present 



a more general result (which directly implies Theorem 1.8.4| ), exploring "param- 
eterised maximum matching problems" . We consider the situation where a fixed 
graph G is given together with an arbitrary set V ^ $ oi "parameters" (which in 
our case are partial assignments, while G = B{F)) and a mapping ip £ V G^, 
where G^ is a partial graph of G (that is, V{G^) = V{G) and E{G^) C E{G)). Let 
us call this parameterisation matching- optimal, if there exists some Lp &V such that 
y{Gip) = i^iG). A matching-optimal parameterisation does not establish a method 
to find some ("good") ip e P where i^(G^) — i'{G) is attained. We consider the 
problem that we want to transform some arbitrary starting parameter (/?o into such 
a good ip, and so we assume further that some relation R CV xV is given such that 
a relation ipRip' indicates an admissible move (in our application ipRip' holds if ip' 
results from </? by a conservative change). Denoting by R* the reflexive-transitive 
hull of R (that is, allowing an arbitrary number of admissible moves), we call the 
parameterisation strongly matching-optimal, if for every ipo G V there exists ip £ V 
with ipoR*ip and i'{Gip) = v{G). 

In the sequel we only consider bipartite G with a bipartition {A,B). We call 
the parameterisation V conditionally extensible if for every ip € V, every matching 
M in G^ and every edge e G E{G) \ M such that M' := M U {e} is a matching in 
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G and no matching M* in G<p with Ad* D M covers the endpoint of e in B, there 
exists (p' G V with (pR*(p' such that M' is a matching in G^'. 

Lemma 1.8.8 // a parameterised matching problem (G,V) is conditionally exten- 
sible, then it is strongly matching- optimal. If furthermore in polynomial time in the 
size of G a sequence of admissible moves from ip to ip' in any conditional extension 
can be found, then in polynomial time (in the size of G) for any ipo V a sequence 
of admissible moves to ip Cz V with viB,^) = v{G) can be found. 



Before proving Lemma 1.8.8, we show that in our apphcation, considering B{F) 
with the parameterisation hy ip ^ TIASS i— ?■ Bip{F) together with R as the relation 
of conservative change, the property of conditional extensibility holds true. So we 
consider the situation where we have a maximal matching M in B^p(F), where an 
edge {C, v} in B{F) exists with uncovered cndpoints, and we show that by just one 
conservative change we can extend M by this additional edge. 

Lemma 1.8.9 Consider a generalised multi-clause-set F 6 M.CCS, a partial as- 
signment ip G TMiSiS, a matching M in B^{F) and an edge {(Co, i), (wo, i)} £ 
E(B(F)) such that neither C :— (Cq, i) nor v := {vq, j) is covered by M . We assume 
furthermore that no matching M* D M in B^{F) covers v. Let M' := MU{{C, v}}. 
By definition M' is a matching in B{F), and furthermore there exists a conservative 
change for p w.r.t. F, resulting in p' , such that M' is a matching in B^i{F). 

Proof Let (t;o,eo) £ Co- If vo ^ var((y5), then :— p o {vo — s> e) for any 
e e Dyg \ {eq} yields the required conservative change; so assume Vq G va.T{p). Now 
we have p>{vo) = ^O; since otherwise M' would be a matching in B^p{F) covering 
V. Let E be the set of values e G Dy^ occurring in M, that is, there is some edge 
{(Cq,^'), {vo,j')} e M with {vo,e) £ Cq. Since v is not covered by M and M is a 
matching, M can cover at most {\Dy^ | — 1) — 1 many variable-nodes with underlying 
variable vq, and so we have \E\ < \Dyg \ — 2. Thus there is e' £ D^g \ {eq}. Set 
ip' ■.= po{vQ -> e'). By definition Af is a matching in i?;^/ (i^). Now consider a clause 
D ^ F falsified by p' , and assume that D is not falsified by p. Thus {vq, e') G D, and 
the literal {vq,£') is the only literal in D satisfied by p. So no clause-node covered by 
M has clause D associated with it. It follows that M* M U {{{D, l),w}} would 
be a matching in B^{F) extending M and covering v, contradicting the assumption 
of the assertion. I 



Thus by Lemma 1.8.8 now Theorem 1.8.4 is proven. In the remainder of this 



subsection we prove Lemma 1.8.8. The reader might recall the preliminaries on 
matchings (Subsection 1.2.2), where the notion of an M-augmenting path P for a 
matching M in a graph G is discussed: a larger matching M + is obtained by adding 
the edges from P to M, which are not in M, while removing the other edges of P 
from M. In order to perform the "relinking" , necessary for the transition from M 
to M+, we show an auxiliary lemma. 



Lemma 1.8.10 As in Lemma 1.8.i consider a conditionally extensible parame- 
terised matching problem {G,P), where {A, B) is a bipartition of G. Consider 
furthermore ip €P, a maximal matching M in Gp, an edge e — {a,b} £ AI with 
a € A,b B , and an edge e' = {a, b'} £ E{G) where b' is not covered by M . Then 
by at most one conditional- extension- step we obtain a parameter ip' with pR*p' 
such that M' {M \ {e}) U {e'} is a matching in G^pi . 

Proof If M' is a matching in G^p, then we are done. Otherwise we have e' ^ 
E{Gip); let Mq := AI \ {e}. We want to apply conditional extension to AIq and 
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M' — Mq U {e'} (if we succeed then we are done), so we have to show that there 
is no matching Mq in covering b' with Mq C Mq. Assume the contrary, and 
consider the edge {x, b'} G Mq-. Since G is a graph (no parallel edges) we have x ^ a, 
and thus MU {{x, b'}} is a matching in (using bipartiteness of G) contradicting 
maximality of M. I 



Now we are in a position to prove Lemma 1.8.8; it suffices to show that for 
any given E V and a matching M in Gip with \M\ < i^^G), by a polynomial 
number of extension steps we can find tp' with ipR*ip' and a matching M' in Gipi 
with \M'\ > \M\ (by repeating this process we finally obtain a maximum matching 
for G). If M is not maximal in G,p, then we can add one edge while keeping ip 
and we are done, so assume that M is maximal in G^. Since M is not a maximum 
matching in G, there exists an M-augmenting path P in G. P is of the form 
(vq, . . . ,Vm,) for (pairwise) different vertices Vi and m odd, such that uq, Wm are not 
covered by M and such that for < i < to we have {vi, Ui+i} ^ M for even i, while 
for odd i we have {vi,Vi+i} £ M. W.l.o.g. Vi E B for all even i. The fir st task 



is for odd i < to to replace the edge {vi^vi+i} by {vi,Vi-i\, using Lemma 1.8.1C 



we proceed consecutively for i = 1, 3, 5, ... , where if at some point Lemma 1.8.10 



is not applicable, then we constructed a matching of the same size as M which is 
not maximal w.r.t. its parameter, and so we obtain M' by enlarging this matching. 
Otherwise, if the process goes through, then at the end we obtain a matching M'q 
in G^/j where \Mq\ = \M\ and Vm-i^Vm are not covered by Mq. Again, if Mq is 
not maximal, then we get the required larger M', while otherwise we can apply 
conditional extension, obtaining M' := Mq U {{wm-i, w™}}. QED 



1.9 Matching autarkies 



In this section we introduce the autarky system for generalised clause-sets given by 
"matching autarkies" , and we develop various polynomial time procedures. "Match- 
ing autarkies" for clause-sets with non-boolean variables have been introduced in 
|46|| , and some basic properties have been stated regarding the direct translation of 
clause-sets with non-boolean variables to cla use-sets with boolean variables. How- 
ever, as we will discuss in Subsection 1.9.4, this earlier version of the notion is 



actually too restrictive (another example for the subtleties one encounters with 
non-boolean variables). An outline of the content of this section is as follows. 

Having a (restricted) concept £ of satisfying assignments, we can "typically" 
obtain an autarky system (recall Subsection L4) by calling ip a autarky "Jor a 
clause-set F if is a ^-satisfying assignment for F[Yai(Lp)] (recall Subsection 1.3.5|) , 
or, equivalently at least for general satisfiability, if 1^9 is a ^-satisfying assignment for 
Fvax{ip)- We have to leave such a general theory to future work, but in this article we 



will consider in Subsection 1.9.1 "matching autarkies" obtained in this way from 
matching-satisfying assignments. A fundamental notion is the notion of a "tight 
sub-clause-set" F' of a clause-set F, characterised by the condition 5{F') = 5*{F) 
(that is, F' realises the maximal deficiency of F). Translating general results of 
matching theory into our setting, the set of tight sub-clause-sets of F form a set- 
lattice (i.e, union and intersection of tight sub-clause-sets are again tigh t), and so 
we have a smallest and a largest tight sub-clause-set. In Subsection 



1.9.2 



consider matching leanness (and the matching lean kernel), and in Lemma 1.9.4 we 



characterise matching lean clause-sets F by the condition that all strict sub-clause- 
sets of F have a deficiency strictly less than the deficiency of F, in other words, F 
is the only tight sub-clause-set. Thus matching leanness is decidable in polynomial 
time, and applying the general procedure from Lemma 1.4.2 we obtain polynomial- 
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time computability of the matching lean kernel in Corollary 1.9.6| . P^ Since the 
empty sub-clause-set has deficiency 0, we obtain S{F) > 1 for non-empty matching 
lean clause-sets. Matching lean clause-sets of minimal deficiency (i.e., deficiency 
1) are further considered in Subsection 



1.9.3 



Finally, in Subsection 



1.9.4, we 



refiect on the definition of "deficiency" and "matching autarky" as defined in this 
article, by comparing it with an earlier version of these notions (for generalised 
clause-sets) . 



1.9.1 Matching autarkies for generalised clause-sets 

A partial assignment ip is called a matching autarky for F G AiCCS if ip is 
matching-satisfying for -F'vai (i/3)' which is equivalent to ip being matching-satisfying 
for i<"[var((p)]. Said explicitly, a partial assignment is a matching autarky for a 
(generalised) multi-clause-set F iff for every clause-occurrence C in touched by (p 
we can choose a literal xc £ C such that ipixc) = 1 and such that for every variable 
V £ var(C) there are at most — 1 many (touched) clause-occurrences C with 
var(a;c) = v- The set of all matching autarkies for F is denoted by MAuk(f ). 
Generalising Lemma 7.1 and the remarks in Section 8 of we get 

Lemma 1.9.1 F G AiCCS i— > MAuk(F) C Auk(i^) is a normal autarky system. 

We denote by Nma •= NmauIj the normal form for multi-clause-sets obtained by 
eliminating all matching autarkies. According to our general results and definitions 
on autarky systems, the set of MAuk-satisfiable multi-clause-sets is just AiSAT, 
the set of matching satisfiable multi-clause-sets. The set of MAuk-lean clause-sets 
is denoted by AdCSAAf, its elements are called matching lean multi-clause-sets. 
We now seek to characterise MCSAJV, and to compute Nma{F) in polynomial time. 

A sub-multi-clause-set F' < F oi a, multi-clause-set F G M.CCS is called tight if 
5{F') = 5*{F) holds. If F' is tight for F, then F' is an induced sub-multi-clause-set 
of F. By supermodularity of the deficiency (for graphs) we immediately get 

Lemma 1.9.2 (Binary) Union and intersection of tight sub-multi- clause- sets of a 
given multi-clause-set are again tight. So the tight sub-clause-sets of a clause-set 
form a set-lattice with smallest and largest element. 



Generalising Lemma 7.3 in 47 , we obtain the fundamental relationship be- 
tween tight sub-multi-clause-sets and matching autarkies: application of matching 
autarkies does not reduce the deficiency, and application of suitable matching au- 
tarkies allows to realise every tight sub-multi-clause-set. 

Lemma 1.9.3 Consider a generalised multi-clause-set F G AiCCS. 

1. For every autarky p for F we have d{ip * F) — S{F) — d{F[vSLT{ip)]) . 

2. For every matching autarky p for F we have 5{ip> * F) > 5{F), and thus 
5*{p*F) = S*{F). 

3. If F' <F is induced, then we have S*{vaT{F') * {F - F')) < S*{F) - 6{F'). 
4- If F' < F is tight, then there is a matching autarky ip for F with p * F = F' . 

for a 



^^^Direct computations using matching arguments are more efficient; see Subsection 
discussion. 



1.12.1 
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Proof For Part ^ note that by definition we have 

c{F) ^ c{ip *F) + c{F[va.r{(p)]), n{F) = n{ifi * F) + n{F[va.r{ip)]) 

due to = (fi * F + -Fvar((p)- Part ^ fohows from Part |l|. For Part |3| consider 
G < var(F') * (F - F'). There exists Gq < F ~ F' with var(F') *Go = G (consider 
the original clauses); so we have c{Go) = c{G) and var(G'o) \var(F') = var(G). Now 

6*{F) > 6{F' + Go) = c{F' + Go) - wn(^^' + Go) = 

c{F') + c(G) - wn(i^') - wn(G) = 5{F') + 6{G), 

and thus (5(G) < 5*{F) - 5{F'). Now Part | follows immediately from Part || due 
to (5*(var(F') * [F - F')) < S*{F) - 6{F') = 0, i.e., F - F' is a matching autark 
sub-multi-clause-set of F. I 



1.9.2 Matching lean generalised clause-sets 

Generalising Theorem 7.5 in [ p7[ , we now can characterise matching lean multi- 
clause-sets: 

Lemma 1.9.4 Consider a generalised multi- clause- set F € A4CCS. The following 
conditions are eguivalent: 

1. F is matching lean; 

2. VGeF:,5*(F-{G}) <5*{F); 

3. VF' <F: S{F') < d{F); 

4-. F is a tight suh-multi- clause- set of F, and there are no other tight sub-multi- 
clause-sets of F. 



Proof From Part follows Part ^ by Lemma 1.9.3 , Part |[ Obviously, Part || 
implies Part |3| , and Part || implies Part ||. Finally, Part |l| follows from Part |^ by 
Lemma |L94 Part 3. I 



Corollary 1.9.5 It is decidahle in polynomial time, whether a generalised multi- 
clause-set F G AiCCS is matching lean. 



Proof Use Lemma [1.7.4 (poly-time computability of the maximal deficiency) and 
Lemma |1.9.4|, Part 0. I 



An alternative method for deciding matching leanness will be given later (by 
1.11.1 together with Lemma |1.11.3| ) . Applying the general procedure from 



Lemma 
Lemma 
the (matching) lean kernel: 



1.4.2| , we can enhance (matching) leanness decision to the computation of 



Corollary 1.9.6 The matching lean kernel Nma(^) for generalised multi-clause- 
sets F G MCCS is computable in polynomial time. 

Since matching autarkies constitute a normal autarky system, we know that 
there are matching autarkies (/? for F with ip * F = Nnia(F); in Lemma 1.10.1 we 



show how to find such "quasi-maximal" matching autarkies in polynomial time, 
using the computation of the matching lean kernel as the main step. See also the 



open problems in Subsection 1.12.1 regarding more efficient computations. 



Back to the characte risati on of the matching lean kernel, by Lemma 1.9.4, Part 
I together with Lemma 1.9.3 , Part || we get 
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Corollary 1.9.7 For every generalised multi- clause- set F G A4CCS the match- 
ing lean kernel Nma(^) is the intersection of all tight sub-multi- clause- sets of F. 
Thus N,„a(-F) is the smallest tight sub-multi-clause-set of F, and therefore 6*{F) = 

5(N^a(F)). 



1.9.3 Matching lean clause-sets of minimal deficiency 



Using S{T) — 0, from Lemma 1.9.4, Part y we get the following generalisation of 
"Tarsi's Lemma" (see §): 

Corollary 1.9.8 // the generalised multi-clause-set F ^ T is matching lean, then 
5*(F) = (5(F) > 1. 

Obviously MU C LEAN , and thus: 

Corollary 1.9.9 // a generalised clause-set F G CCS is minimally unsatisfiable, 
then we have S*{F) = S{F) > 1. 

In Theorem 4.5, arbitrary constraints over boolean variables are considered, 
and a lower bound on the number of clauses in terms of the number of variables for 
minimally unsatisfiable constraint satisfaction problems is derived, which necessarily 
is much weaker than Corollary 1.9.9. 

After these general structural results on matching autarkies and matching lean 
clause-sets, we conclude this section by some example classes. Considering mini- 
mally unsatisfiable clause-sets of minimal deficiency, we observe that removing any 
clause from a matching lean multi-clause-set F with 5{F) = 1 yields a matching 
satisfiable multi-clause-set, and thus 



Corollary 1.9.10 MUs=i = MC£A\fs=inUSAr. 

The class A4l4s=i of minimally unsatisfiable clause-sets of minimal deficiency is 
characterised in Theorem 2.5.5 (Part II), and so we have a good understanding of 
the unsatisfiable elements of MC£AN's=i- The satisfiable elements of MjC£AAfs=i 
on the other hand seem to have a more complicated nature, where the interest in 
this class may be justified by the following property. 

Corollary 1.9.11 The class MC£AJ\fs=i of matching lean generalised clause-sets 
of deficiency 1 is exactly the class of all minimally matching unsatisfiable clause-sets 
( clause-sets which are not matching satisfiable, while every strict subset is matching 
satisfiable) . 

An interesting example for a satisfiable boolean F G A4.C£AN's=i with n{F) — 8, 
exhibited in Section 5 of is given by the following clause- variable matrix (the 
rows correspond to the clauses, the columns to the variables, where an entry "±" 
denotes a positive/negative occurrence, while denotes non-occurrence): 
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Obviously S{F) = 1. Since every two different rows clash in exactly one element, 
_F is a 1-regular hitting clause-set. Every column contains at least two "— " and 
two "+" , and thus every variable occurs at least two times negatively as well as 
positively (the purpose of this example was to refute a conjecture by Endre Boros 
that every boolean 1-regular hitting clause-set of deficiency 1 must contain a variable 
occurring in one sign only once). To demonstrate that F is matching lean, consider 
the following subgraph of B{F): 




Here variable-nodes corresponding to row j are denoted by Wj, and clause-nodes 
corresponding to row i by c'. This subgraph has the special property that it is a 
spanning tree, where the variable-nodes all have degree 2. From these properties 
by Corollary 1.11.2| in the next subsection it follows that F is matching lean.p^ 

Finally we mention that a natural class of (generalised) multi-clause-sets, where 
every (general) autarky is a matching autarky, is the class of multi-clause-sets F 
such that for all variables v G var(F) and all values e e D„ we have 4i^[v,e){F) ^ 1; 
we call such (generalised) clause-sets totally singular. Thus totally singular F are 
satisfiable iff they are matching satisfiable; an alternative method for quick SAT 
decision here is given by the observation that the class of totally singular multi- 
clause-sets is stable under DP-reduction (recall Subsection 1.6), and furthermore 
every variable is either pure or a singular variable, and thus elimination of pure 
variables together with repeated application of singular DP-reduction also solves this 
formula class. We remark that the minimally unsatisfiable F in this class are exactly 
the marginal minimally unsatisfiable clause-sets of deficiency 1, as characterised in 
Corollary 1^ (Part II). 



1.9.4 Comparison with an earlier version of "matching au- 
tarkies" 

In [^6| an earlier version of matching autarkies has been introduced, which we will 
call here "non-repetitive matching autarkies" : A partial assignment is called non- 
repetitive matching- satisfying for a multi-clause-set F G M.CCS, if for every clause- 
occurrence C in _F (taking multiple occurrences into account) a literal xc G C 
can be chosen with (p{xc) = 1 such that for different clause-occurrences C, C" we 
have Xc 7^ xc'- And Lp is called a non-repetitive matching autarky for _F if 1^9 is 
non-repetitive matching-satisfying for -FVar((p)- 

Recalling the three conditions (|) - (|i]) from Subsection 1.7.1 and strengthening 
condition (m) to 



■^'''This can also be seen directly as follows. For a graph G let &{G) := \E{G)\ — \V{G)\. Using 
k(G) for the number of connected components of G, we have that G is a forest iff 5(G) = — k(G) 
(while for every graph G we have (5(G) > — k(G)). Now consider any non-empty set V' of variable- 
nodes in the above forest T (that actually we have a tree is irrelevant) together with the induced 
sub-graph T' given by V U Tjr(V'). Since also T' is a forest we have < < — 1, 

where \E{T')\ = 2\V'\ and \V{T')\ = \V'\ + |r^/(y')l: and thus \Vjr,{V')\ > \V'\ + 1, where 
|rjr/(y')l < + so that S{F[V']) > 1. Since this holds for every non-empty V, it 

follows that F can not have a non-trivial matching autarky. 
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(|)' for i e {1, . . . ,m\ there are variables vi S var(^i) such that for aU clause- 
occurrence C in Fi there are literals xc G C with var(a;c) = Vi, and such that 
for different clause-occurrences C, C" we have xc 7^ xc ; 

we get that F is non-repetitive matching satisfiable iff F has a decomposition ful- 
filling (|)', (H) and d]). By (§)' we get c(^;) = |val„,(Fi)l: and thus from follows 
(|iii|)'. Whence a non- repetitive matching-satisfying assignment 1^9 for F is matching- 
satisfying for F , and a non-repetitive matching autarky for F is also a matching 
autarky for F . 

For boolean clause-sets, non-repetitive matching autarkies are identical with 
matching autarkies, but in general non-repetitive matching autarkies are more re- 
strictive than matching autarkies. Consider a variable v with = {0,1,2}, and 
consider the multi-clause-set Fi := 2 • {u ^ 0}. Fi is matching satisfiable, while 
being lean w.r.t. non-repetitive matching autarkies. Neither pureness of v nor the 
existence of unit-clauses is essential here, as the following extended example shows 
(using additional boolean variables a, h, c, d, while Dy ~ {0, 1, 2, 3}): 

F[ {{t;^0,a,6},{«^0,a,fe},{a,6},{a,fe}, 

{v^ l,c,d},{i;7^2,c,d},{u^3,c,d}}. 

F[ is matching satisfiable via (w — J> 1, a — ;> 0, 6 0, c — > 1, d ^> 0), while F[ is again 
lean w.r.t. non-repetitive matching autarkies. 



Further examples are discussed in Subsection 2.3.3 (Part II). There we actually 
show that non-repetitive matching autarkies are preserved by the direct translation 
of (generalised) clause-sets into boolean clause-sets, which in general is not the case 
for matching autarkies, and so perhaps non-repetitive matching autarkies never- 
theless might have some advantages over matching autarkies? The main problem 
with the notion of non-repetitive matching autarkies is that it does not seem to 
support a natural notion of related deficiency (with the same nice properties as 
the combination of matching autarkies and (standard) deficiency), and, related to 
this problem, it does not seem obvious how to achieve polynomial time decision of 
the class of non-repetitive matching lean (multi-)clause-sets. The whole problem 
boils down to the point that non-repetitive matching autarkies do not seem to be 
given solely by a matching condition, but require some other form of a more global 
condition. Thus, to conclude, the generalisation of (boolean) matching autarkies 
together with the generalisation of (boolean) deficiency introduced in this section 
seems to be the right choice, as demonstrated by the theory built up in this section, 
and as further validated by the applications in the following sections. 



1.10 Finding autarkies in poly-time for bounded 
m2iximal deficiency 

We already know that the satisfiability problem is decidable in polynomial time 



for bounded maximal deficiency (Corollary 1.8.7). In this section we strengthen 
this by computing a maximal autarky in polynomial time (for bounded maximal 
deficiency). Several first attempts fail to solve the problem: 



In Sections 11.10.1 - 11.10.3 of [k5[ general methods for computing the lean 



kernel are discussed. Especially the general method from Corollary 1.5.2 (in 
the present article) could yield a poly-time algorithm, since for bounded max- 
imal deficiency we can find either a satisfying assignment or a resolution refu- 
tation tre e in p olynomial time (the latter follows by the methods used to prove 
Theorem ^.3.5|) (Part II). 
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• But the problem is that crossing out variables can increase the maximal de- 
ficiency, and so actually these methods seem to fail for bounded maximal 
deficiency. 

• Based on matroid theory, in it has been shown how to compute the 
lean kernel in polynomial time for boolean clause-sets, which via the direct 



translation to boolean clause-sets (see Subsection 2.3.2 in Part II) can be 



generalised to non-boolean clause-sets as considered here. 

However we do not obtain an autarky, and self-reducibility ([^) seems not 
to be applicable — again the problem is that the maximal deficiency might 
be increased. 



Fortunately we are able to strengthen the method from Corollary 1.8.7, which will 
be done in the following. First we have to strengthen Corollary 1.9.6, by efficiently 
computing for multi-clause-sets F a partial assignment with ip * F = N^n{F). 
Again, for finding such a "quasi-maximal" matching autarky ip for F the general 
procedure given in (just based on deciding matching leanness, via an oracle) can 
not be used, since the matching condition does not allow addition of unit-clauses. 
The following method was already mentioned in Lemma 1.4.2, but for completeness 
we state this special case explicitly. 



Lemma 1.10.1 For a generalised multi-clause-sets F G MCCS we can compute a 
matching autarky ip for F with ip * F ~ ^raa.{F) in polynomial time as follows: 



First compute Nijia(^') by Corollary 1.9.(\ . 



Thus by Lemma j| there is a matching autarky ip for F with ip * F ~ 
N,„a(-F), namely ip is any matching- satisfying assignment for F\V] of V :— 
var(f ) \ var(Ni„a(-F')) (where vaT{ip) ^V), and those can be found by Lemma 



L8.1 



Second, we need to strengthen the general decomposition result Lemma 1.4.1 by 



considering the (maximal) deficiency of the satisfiable part of the decomposition: 



Lemma 1.10.2 Consider a normal autarky system A, and the decomposition F = 
Fi -\- F2 of a generalised multi-clause-set F G JVlCCS according to Lemma \l.4.ii into 
the A-lean kernel Fi — N^(i^) and the largest A-autark sub-clause-set F2 — Fy for 
V :~ var(_F') \ var(i<i). Assume furthermore that Fi is matching lean. Then we 
have S*{F[V]) < 6*{F), and if Fi ^ T, then S*{F[V]) < S*{F) holds. 



Proof Consider F' := Fi, and assume that Fi ^ T. Lean kernels w.r .t. au tarky 
systems are always induced sub- multi-clause-sets, and thus by Lemma 1.9.3, Part 
I we have 5*(var(F') * (F - F')) < S*{F) - S{F'), wh ere va r(f Q 
var(i^i) *F2= F[V], and d{F') = d{Fi) > 1 by Corollary [TgJ. I 

Now, strengthening Corollary 



(F - F') 



1.8.7] , we are not just able to find a satisfying 



assignment in polynomial time (if existent) for bounded maximal deficiency, but we 
can also find a non-trivial autarky (if existent): 



Theorem 1.10.3 Consider a constant fc G Nq. Then for a generalised multi- 
clause-set F G AiCCS with S*{F) < k in polynomial time we can find a non-trivial 
autarky if existent (and thus, by repetition, we can compute Ni„a(^') in polynomial 
time). The procedure for finding a non-trivial autarky is as follows: 
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As in Corollary 1.8.11 ™^ ^'''^ through all non-trivial partial assignments (p 



with vav{ip) C var(_F') and n(ip) < 6*{F). 



Instead of just checking whether ip*F is matching satisfiable, by Lemma 1.10.^ 



we compute a partial assignment ip with ip * (tp * F) = NmaCv' * F) (and 
var(^) C va,i{(p * F) ). 

Then we check whether tp o ip is a non-trivial autarky for F — if this is the 
case, then we found a non-trivial autarky for F , while if this is never the case, 
then F is lean. 



Proof Let V := var(F) \ var(Na(-F)). So F[V] is satisfiable, and by Corollary |l. 8.6 
there exists a partial assignment tp with n((p) < S*{F[V]) and vaT{ip) C V such that 
ip* F[V] is matching satisfiable; note that we read ip* F[V] as 1^9 * and that 



ip * {F[V]) = * F)[V] holds. By Lemma |1.10.2| we know that d*{F[V]) < S*{F), 
and thus ip will be considered by the procedure, and the assignment "0 computed 
satisfies {<p * F)[V]. So we have a satisfying assignment ipop for -^[1^], i.e., we have 
an autarky o ip ioi F with ip * {(p * F) = Na(F). I 

As the proof of Theorem L10.3| shows, by running through all ip we will actually 



find a quasi-maximal autarky 6 (with 6*F = Na(i^)), however it is somewhat easier 
to stop when the first non-trivial autarky has been found. 



1.11 Expansion and the surplus 

In this section the notions of "expansion" and "surplus" are studied. The basic 
question is: if we consider arbitrary non-empty sets of variables and all clauses 
containing at least one of them, how many more clauses than variables do we have 
at least? This minimum (over all "expansions") is the surplus. Having at least a 



surplus of 1 turns out to be equivalent to being matching lean (Lemma 1.11.1). 

One important application of the surplus is in establishing that by fixing one 
variable to a value, the maximal deficiency must get smaller, and this matter is dis- 



cussed in Subsection 1.11.3 (generalising the results from |8^). We obtain a simpli- 
fied proof of fixed-parameter tractability of satisfiability in the maximal deficiency 
S*{F) for boolean clause-sets, using the method of "bounded search trees". The 
main tool for this application is a poly-time reduction S : MCCS — > M.CCS (see 



Lemma 1.11.9) with the properties, that the maximal deficiency is not increased 
while a surplus of at least 2 is established. In general we have for all variables 
V G var(i^) and all e G Dy the upper bound 

5*{{v e) * S{F)) < S*{S{F)) - 1 



(see Corollary 1.11.5), given that the surplus of F is at least as big as the domain 
size of v (and some further reduction condition holds, namely F does not contain 
singular variables). Thus by a trivial DLL branching algorithm, using additionally 
only the reduction F 1-^ S{F) at each node, we obtain SAT decision for F in time 
2* (f") . poly(£(F)) for boolean clause-sets. Later, in Theorem 2.3.5| (Part II), we 



show how to achieve the same bound also for generalised clause-sets, by exploiting 
the direct translation to boolean clause-sets. 



1.11.1 Basic properties 

For multi-clause-sets F we have defined the bipartite graph B{F) together with 
its canonical bipartition {F_,V_). The general definition of deficiency (for arbitrary 
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graphs) then yields the deficiency Sb{f){F') — ~ \^ b(f){F')\ for sets F' C F_ 
of clause-nodes, as well as the deficiency 6b(f){V) = \V\ — \^ b{f){V)\ for sets 



y C y of variable-nodes. ^ So we have a "clause-based deficiency" as well as a 
"variable-based deficiency" . Identifying F' with a sub-multi-clause-set of F, we have 
= c(F') and \Tb(f){F')\ = wn{F'), and thus 6B{F)iF') (the graph-theoretical 
deficiency of sets of clause-nodes) is the same as the deficiency of multi-clause-sets as 



we have defined it in Subsection 1.7.1 . At first sight, the situation for V' seems not 
to be naturally interpretable on the level of multi-clause-sets, since V may contain 
for some variable v only some of the \Dy \ — 1 copies of v. To consider this problem, 
let V" the the set of variable-nodes obtained from V' by adding for {v,i) E V all 
(w, j) for j e {1, . . . , \Dy \ — 1}. Now we have 

SBiF){V") ^ -S{F[V^]), 

where Vq is the set of variables in V (or V"). Since SB{F){y) < ^b(f){V") due 
to rB(F){y) = ^B{F)iV'), we see that actually, since we are only interested in 
maximising the deficiency, also the variable-based deficiency has a sensible inter- 
pretation at the (conceptual) level of multi-clause-sets. Now two changes are applied 
to the variable-based deficiency, resulting in the notion of "expansion", related to 
the deficiency of a set of variables (analogous to the deficiency of a (sub-)multi- 
clause-set), and in the notion of "surplus" related to the maximal deficiency over 
all sets of variables (analogous to the maximal deficiency of a multi-clause-set). 
The first change is just to switch signs, so that we can use (5(i^[Vi3']) instead of 
—S{F[Vq]). More substantially, we exclude the empty set of variables for the sur- 
plus: The maximal deficiency d*{F) of a multi-clause-set is only used to determine 
the size of a maximal matching in B{F), and so negative deficiencies are not of 
interest (they indicate that a bigger matching number is possible — if only there 
would be more clauses) , whence the empty clause-set is taken into account in 5* (F) 
for convenience, to force the maximal deficiency to be at least 0. But now for the 
notion of surplus actually we are only interested in the negative values, that is, in 
the "surplus" which can not be realised, and thus the empty set of variables has to 
be excluded. After these motivations, let us now start with the formal definitions. 

For a multi-clause-set F and a set V of variables let the expansion be defined 
as S{F[V]). As explained above, using the deficiency dB(F) in the (bipartite) graph 
B{F), the expansion equals —Sb{f){V), where V is the set of variable nodes of 
B{F) associated with some variable in V. The surplus of F is defined as 



rriF) 



j min0^ycvar(i^) ^^1^]) if var(i^) ^ 
[O if var(i^) = 



The surplus of F equals the surplus of B{F) as defined in Subsection 1.3 of |Q 
(but with the sides of the bipartition switched(!)). By definition we have cr(F) < 
(5(F[var(F)]) = S{F) - F{±) < S{F). Generalising Lemma 7.7 in Q we have: 

Lemma 1.11.1 A generalised multi-clause-set F G AiCCS with n(F) > is 
matching lean if and only if cr(F) > 1. More specifically, for any generalised multi- 
clause-set F we have: 

For $ ^ V Q var(i^) in case of S{F[V]) < the generalised multi-clause-set 
F[V] has a non-trivial matching autarky; such a non-trivial matching autarky yields 
a non-trivial matching autarky for F , and every non-trivial matching autarky of F 
can be obtained in this way. 



^'^^ Recall that Vq{A) for a graph G and a vertex-set A is the set of all neighbours of A in G. 
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Proof If F is matching lean, then by Lemma 1.9.1| for V C var(F) also F\V] is 



matching lean, and thus by Corollary 1.9.S we get a{F) > 1. If on the other hand F 



is not matching lean, then there exists / V C var(i^) such that F[V] is matching 
satisfiable (where V is the variable set of any non-trivial matching autarky), i.e., 
6*{F[V]) = 0, and thus cr(F) < 0. I 

By Theorem 1.3.8 in [|7^ we obtain the following characterisation of matching 
lean clause-sets: 

Corollary 1.11.2 A generalised multi- clause- set F is matching lean if and only if 
there exists a subgraph T of B{F) with the following properties: 

(i) F is a forest; 
(ii) F covers all variable-nodes; 
(Hi) every variable-node has degree 2 in F. 



An example for the application of Corollary 1.11.2| has been given at the end of 



Subsection 1.9.1. The problem of computing a{F) can be solved by Theorem 1.3.6 



in ||70| as follows (compare Lemma 15 in ||82| for the case of boolean clause-sets): 
Lemma 1.11.3 Consider a generalised multi- clause- set F. 

1. Let M be a maximum matching for B{F). If M does not cover all variable- 
nodes, then a{F) = |Af |— wn(_F) < (that is, <j{F) is the number of uncovered 
variable-nodes, multiplied by —I). Otherwise we have (y{F) > 0. 

2. Assume cr{F) > 0, and consider s € Ng, s < S{F). For v £ var(i<") with 
\Dv\ > 2 (trivial variables are ignored) let Fg^^ be the multi-clause-set obtained 
from F by adding s new elements to the domain of v ( that is, a new variable 
v' with Dy C Dy' and \Dyi\ — \Dy \ + s is chosen, and Fs^v 'is obtained by 
replacing v by v' in F). Let Ma^v be a maximum matching for B{Fs^v)- Then 
we have: 

(a) If Ms^v does not cover all variable-nodes in B(Fs,v), then cr[F) < s, and 
moreover, from Ms.v in linear time in £{F) a set 7^ C var(i^) with 
6{F[V]) < s can be computed . 

(b) If for all V the maximum matching Ms^v covers all variable-nodes in 
B{Fa^y), then a{F) > s. 



Lemma 1.11.1 together with Lemma 1.11.3 yields an alternative to the decision 



procedure for matching leanness as given in Corollary 1.9.5 



1.11.2 Decreasing the maximal deficiency 

The next lemma tackles the problem of giving a sufficient criterion for S*({v — > 
£)*F) < S*{F); Part H generalises Lemma 7.10 in (the proof there is technically 
not fully correct). 

Lemma 1.11.4 Consider a generalised multi- clause- set F G A4CCS. 

1. For F' < F we have 

5{F') = (5(F)-(5(F[var(i^)\var(F')])-r;(F,F') < 5{F)-5{F[yw{F)\yw{F')]), 

where rj{F, F') :— c{F) — c(Fvar(F)\var(_F')) ~ c{F') > is the number of clause 
occurrences in F of clauses C with var(C) C var(i^') but not occurring in F' . 
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2. For F' <F we have 5{F') < 5{F) - mm{c{F) - c{F'), (j{F)). 

3. Consider v £ vslt(F). Then for e G D^^ we have 

S*{{v ^e)*F)< SiF) ~ min{s^,^,-,{F), a{F)) + \D,\ - 1. 
Thus, using my{F) := min^g/j^ •S(D.e)(-F'); we obtain 

S*{{v -^e)*F) < S{F) ~ mm{my{F), a{F)) + |L>„| - 1. 

Proof For Parts and | let V := var(F) \ var(F'). 

The equation in Part ^ follows immediately with S{F) ~ c{F) — wn(F) and 
5{F[V]) = c{F[V]) - wtl(F[V]), where due to var(F[F]) = ^ we have wtl{F[V]) = 
wn(F) — wn(i^'). And that ri{F,F') > holds follows with the explanation given. 

For Part |^ we consider two cases. If n{F') — n{F), then S{F') = S{F) — 
(ciF) - c{F')). So assume n{F') < n{F) (and thus V ^ 0). By Part we have 
b[F') < 5{F) - S{F[V]), and thus by S{F[V]) > a{F) we get 5{F') < S{F) - a{F). 

For Part || consider an induced F' < (u — > e) * F, and let F" < F he the unique 
sub-multi-clause-set of F with c{F") = c{F') and (w e) * F" — F' . We have 
wn(F") < wn(i^') -|- - 1, and thus 5{F") > S{F') - \Dy \ + 1. 

By part | we get < S{F) - min{c{F) - c{F"),a{F)), where c(F") < 

c{F) — and the assertion follows. I 

Considering a matching lean boolean clause-set F, Part || of Lemma 1.11.4 yields 



the up per bound S*{{v — > e) * _F) < 6{F) for non-pure variables v (using Lemma 
1 . 1 1 . l| ) ; for algorithmic purposes we are interested in cases where the maximal 



deficiency actually decreases: 

Corollary 1.11.5 Consider a generalised clause-set F and a variable v G var(i^) 
with niy{F) > \Dy\ (using the definition ofm^^F) from Lemma 1.11.^ , Part^ and 



cr{F) >\Dy\. Then for each e e Dy we have S*{{v e) * F) < S{F) - 1. 



It is unclear, whether Corollary 1.11.5 is best possible — the condition a{F) > \Dy\ 



is hard to establish for larger domain sizes. The key seems to be to improve the 



estimation used in the proof of Lemma 1.11.4, Part 



I. 11.3 Reductions 

In thi s final subsection we study how to establish the prerequisites of Corollary 

II. II.4 By singular DP-reduction we can eliminate cases with my < \Dy \ as follows. 



Lemma 1.11.6 Consider a generalised multi-clause-set F 6 M.CCS and a variable 
V £ var(F). 

1. Assume v is not pure in F. Then v is a singular variable for F if and only if 



my{F) < \Dy\ (where my{F) is as defined in Part^ of Lemma LII.4) 
2. Assume v is a singular variable. 

(a) 5{DPy{F)) < d{F). 

(b) If V is non- degenerated, then we have 

I. 5{T)Vy{F))^5{F). 
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ii. F is matching lean if and only if DP„ (F) is matching lean. 
Hi. If F is matching lean, then we have S*{DPy{F)) = d*{F). 

Proof The only non-obvious assertion is ^(b)ii| (using the remarks made before 



Lemma 1.6.1, and Lemma 1.9.4). Let Dy = {ei, . . . ,ek} {k = \Dv\), and assume 
w.l.o.g. that = 1 for i < fc; consider Ci, . . . , Ck-i £ F with (w, Si) £ Ci for 

i < k, and let Di, . . . ,D„i G F be the clauses containing (u,£fc) {m = 
Thus = {Ci, . . . , + i{A}. Now with F' ■.= F- we have 

DP^{F) ^ F' + R, 

where R := YnLi { -R* } and R, := Ci U • • • U Ck-i U A for « e {1, ■ . . , m}. First 
assume that F is matching lean, but that we have a non-trivial matching autarky 
ip for DP^(F) with var(<^) C var(DP„(F)) = var(i^) \ {v}. Let V := var((^). If 
V n var(i?) = 0, then (p would also be a matching autarky for F, since var(i?) — 
var(i^{i,}) \ {v}. So assume V fl var(i?) ^ 0. If there exists i G {1, . . . , m} with 
var(Z?i) n = 0, then for all j < k we have var((7j) fl F = 0, and it follows that (p 
would also be a matching autarky for F. So assume that for all « G {1, . . . , m} we 
have var(Z)i) n F ^ 0. Now (u et) o is a matching autarky for i^, contradicting 
matching leanness of F. 

For the reverse direction assume that DPt, (F) is matching lean, but that we have 
a non-trivial matching autarky ip for F. Now it is not hard to see that ip is also a 
matching autarky for DPv{F). I 



Corollary 1.11.7 There is a polynomial time computable map r : MCCS 
A4CCS, such that for a generalised multi- clause- set F we have: 



(i) n{r{Fj) < n[F), c[r{F)) < c{F) andS*{r{F)) < 5*{F). 
(ii) r{F) is satisfiability-equivalent to F. 
(Hi) r{F) is matching lean. 

(iv) r{F) is lean w.r.t. pure autarkies (i.e., r(F) does not contain pure variables), 
(v) r(F) does not contain singular variables. 



Computation of r{F) is as follows: 



1. Apply singular DP- degeneration reduction and reduction by pure autarkies and 
matching autarkies as long as possible. 

2. If there exists a singular variable, then it must be non-degenerated, thus a p- 
plying DP-redu ction does not increase the maximal defect by Part 2(b)ii\ of 
Lemma 1.11.6 , so apply this reduction and go to Step Otherwise output 
r{F) and stop. 



The next lemma contains the main idea for establishing a surplus of two. 

Lemma 1.11.8 Consider a multi- clause- set F G AiCSAAf , such that for all vari- 
ables V G var(i^) we have =ffy{F) > \Dy \ -\- 1, and assume that V C var(_F) is given 
with 6(F[V]) = 1. Then F[V] is satisfiable, and a satisfying assignment ip with 
var((^) C var(i^[y]) can be found in polynomial time. (With p thus we have found 
a non-trivial autarky for F.) 
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Proof Assume that F[V] is unsatisfiable. By Corollary 1.9.10 thus we have F[V] G 
A4Us=i which would contain a variable occurring in all signs exactly once (see the 
later Lemma ^.5.4 in Part II, whose proof does not make use of the results of this 
subsection), contradicting the assumption. So F[V] is satisfiable, and a satisfying 
assignment (p can be found by Corollary 1.8.7. I 



Strengthening Corollary 1.11.7 we get now 



Lemma 1.11.9 There is a polynomial time computable map S : MCCS — > MCCS 

such that for a generalised multi- clause- set F we have: 

(i) n{S{F)) < n{F), c{S{F)) < c{F) and6*{S{F)) < S*{F). 
(a) S{F) is satisfiability-equivalent to F. 

(Hi) S{F) is matching lean and lean w.r.t. pure autarkies, and does not contain 
singular variables. 

(iv) Ifvai{S{F)) ^ 0, then <t{S{F)) > 2. 
Computation of S{F) is as follows: 



1. First reduce F :~ r{F) (see Corollary 1.11.7). 

2. //var(F) = 9 or a{F) > 2 then stop. 

3. Otherwise find some % ^ V ^ var(i^) with 5{F\y]) ~ cr{F) = 1 by Lemma 
l.ll.S, Part^; by Lemma 1.11.^ we can now find a non-trivial autarky ip for 



F: reduce F :— ip * F , and go to Step 

If we only allow boolean clause-sets, then, as explained at the beginning of this 
subsection, we obtain fixed-parameter tractability of satisfiability decision w.r.t. 
the parameter S*{F) by Lemma 1.11.9| and Corollary 1.11.5 (together with Lemma 
1.11.6, Part|^). Finally we mention that a good possibility for further improvements 
is to generalise and strengthen the approach from Q based on matroid theory. 



1.12 Conclusion and open problems 

The first purpose of this article was to set the stage for the study of generalised 
clause-sets as sets of "no-goods" , where literals are given by one "forbidden value" : 
We defined and summarised the basic properties of syntax, semantics, resolution 
calculus and autarky systems. Then we considered the generalisation of the notion 
of deficiency for these generalised clause-sets, and we studied the basic autarky 
system related to this notion, namely matching autarkies. 



1.12.1 Matching autarkies 



Basic tasks are the computation of the matching lean kernel and the computation 
of a quasi-maximal matching autarky (realising the matching lean kernel). In this 
article we did not explore further algorithmic details, and so the computation of the 
matching lean kernel in Corollary 1.9.6 just uses the general procedure of Lemma 
1.4.2| , based on the simple decision procedure for matching lean clause-sets from 



1.9.5| (alternatively. Lemma 1.11.1 together with Lemma 1.11.3 can be 



Corollary 

used); the computation of a quasi-maximal matching autarky in Lemma 1.10.1 
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turn is based on Corollary 1.9.6 in the obvious way. It should be possible to use 



the characterisation of the matching lean kernel in Corollary 1.9.7 (as the smallest 
tight sub-clause-set) for an alternative procedure for computing Nma{F) directly, 
exploiting matching theory. 

Many questions are still open regarding polynomial-time satisfiability decision 
in the maximal deficiency (for generalised clause-sets): 



1. In Corollary 1 



7 polynomial-time satisfiability decision is achieved by using 
the existence of matching-maximum satisfying assignments. Is it possible to 
show also fixed-parameter tractability in this way, using perhaps some form 
of local search which considers only matching-maximum assignments in some 
form? 

Is it possible to directly generalise the approach which yields FPT for the 
boolean case, as given at the end of Section 1.11? 



3. In Section 1.10 it was shown how to find a non-trivial autarky in polynomial 
time in the maximal deficiency, while the approach from based on matroid 
theory yields computation of the lean kernel in polynomial time in the maximal 
deficiency. The question here is, whether for these harder tasks we also have 
fixed-parameter tractability, or whether the problem of deciding leanness is 
fixed-parameter intractable in the maximum deficiency. 

4. Implementations of the various poly-time algorithms here (especially the al- 
gorithm exploited in the proof of Theorem 2.3.5 in Part II) need to be carried 
out in order to study whether interesting applications exist (and also to judge 
whether "native algorithms" for generalised clause-sets are preferable here, or 
whether the boolean translation is superior). 

5. As an alternative to the direct translation, can we also use the (new) nested 
translation from Subsection 2.6.1 (Part II) here? 



Regarding Lemma 



1.11.^ 



one should study the underlying autarky system, and 
whether there is an underlying hierarchy of autarky systems (and accompanying 
reductions) . 

W.r.t. practical applications, one can add the following: 

1. One needs to implement (and to study) reduction by matching autarkies, as a 
preprocessing step and also at each node of the search tree of a DPLL solver. 



2. The extension by Lemma 1.11.8 might have potential 



In general it seems that applications of "expensive" algorithms might be more fruit- 
ful for harder problems like QBF (where autarkies for quantified boolean formulas 
consider only the existential variables, and in the most general setting substitute 
them with boolean functions in the preceeding universal variables such that every 
clause becomes a tautology). 



1.12.2 Satisfiable clause-sets which are minimally matching- 
unsatisfiable 

The class of minimally matching-unsatisfiable clause-sets is exactly AiC£AAfs=i, 
the set of matching lean clause-sets of deficiency one. The unsatisfiable elements 
we know quite well by USAT Ci MC£AAfs=i — M.Us=i, which is characterised 
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in Subsection 2.5. 2| (Part II). However the satisfiable elements given by SAT H 
M.C£AMs=i seem to exhibit a much richer structure: See for example the special 
clause-set at the end of Subsection 1.9.l| , which in fact is a 1-regular hitting clause- 
set (see Section |2.6| in Part II for further studies) — even for this special sub-class 
of satisfiable matching lean 1-regular hitting clause-sets we know not much. 
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Chapter 2 



Minimal unsatisfiability and 
conflict structure 



2.1 Introduction 

We conclude here the study of the basic properties of "generahsed clause-sets", 
started in (see Chapter || in this report); please see there for the basic definitions. 
The main focus here is on irredundant and minimally unsatisfiable clause-sets, and 
on translations to boolean clause-sets (while the first part focused on the basic 
framework and on notions related to autarkies). 



2.1.1 Translations 

Given that much more is known about boolean clause-sets than about generalised 
clause-sets, and that also basically all SAT solvers only accept boolean clause-sets, 
it is desirable to have a rich toolkit regarding translations from generalised clause- 



sets to boolean clause-sets. We start with the best-known translation in Section 2.3, 
the direct translation, where for every generalised literal {v,e) (meaning "w ^ e"), 
consisting of variable v and value e G , we have a corresponding boolean variable 
T{{v,e)). Since we only have "negative literals", not positive literals "u = e", 
we don't need to enforce that variables get only one value. This weak form of 
the direct translation is in most cases inferior to the strong form (which uses the 
"AMO" -constraints for "at most one value") w.r.t. SAT solving, however regarding 
theoretical purposes it is far easier to handle, since much of the structure of the 
clause-sets is preserved by the translation. For example this translation preserves 



the deficiency, and in Theorem 2.3.5 as an application we can lift fixed-parameter 
tractability in the maximal deficiency from boolean clause-sets to generalised clause- 
sets. 

The direct translation (in the weak form) preserves basic autarky structures, 
however not the conflict structure, given by the conflict multigraph, which has 
as vertices the clauses, and as many edges between vertices (clauses) as they have 
clashing literals, that is, literals with the same underlying variable but with different 



values. In Subsection 2.6.1 we introduce a new translation, the nested translation, 
which preserves the deficiency as well as the conflict structure. For a variable v 
with k = \Dy\ values it uses k — 1 boolean variables, while each value of v is 
represented by one of the clauses of the unique (up to isomorphism) saturated 
minimally unsatisfiable Horn clause-set with A: — 1 variables (and k clauses). This 
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new translation is applied in Subsection 2.6.2, allowing to lift various fundamental 



results regarding the conflict structure of boolean clause-sets to generalised clause- 
sets. 

Finally a very general translation, the generic translation^ is outlined in Sub- 
section 2.7.3. The generic translation covers all known translations, and allows to 



use arbitrary unsatisfiable boolean clause-sets to represent the values of the non- 
boolean variables (and this individually for every variable). The first empirical 
study 1^ using these translations applied the instances of the generic translation 



as listed in Subsection 2.7.3 to the variation of van-der-Waerden problems (recall 
Subsection |1.1.2| in Part I) given by using the first n prime numbers instead of the 
first n natural numbers, computing Green-Tao numbers^ based on the Green- Tao 
Theorem (see The nested translation turned out to be always far superior 

over the direct translation, for all solver types, and only for larger domain sizes the 
logarithmic translation performed even better. 



2.1.2 Irredundant and minimally unsatisfiable clause-sets 

"Irredundant" (generalised) clause-sets F are clause-sets where every clause con- 
tributes some unique falsifying assignment ; if F is also unsatisfiable, then one speaks 
of "minimal unsatisfiability" . In Section 2.4 we provide the basic definitions and 
properties. In a sense the "most" irredundant clause-sets (see Corollary 2.4.8| ) are 
"hitting clause-sets" , where all falsifying assignments are unique for each clause.|^ 
A larger class is given by "multihitting" clause-sets, which are actually in general 
no longer irredundant, but they have exactly one "irredundant core", which can be 
found quickly. 

Minimally unsatisfiable clause-sets are then the subject of Section 2.5. Recall 
that in Part I ( |]60| ) the notion of deficiency has been generalised, and its main 
properties have been established. Now we generalise the characterisation of the 
base-case of deficiency 1 known from the boolean case. 



In Section 2^ we return to the study of hitting clause-sets, now from the point 
of view of the "conflict structure" , that is, considering the (multi-)graph of conflicts 
("clashes") between clauses. The main result is that via the so-called "hermitian 
defect" (obtained by considering quadratic forms, or, equivalently, positive and 
negative eigenvalues) we obtain an upper bound on the deficiency. For certain 
"regular" clause-sets the hermitian defect is easy to compute, and we can generalise 
various characterisations from the boolean case. 



2.1.3 Overview and main results 



In Section 2.3 the so-called direct translation of generalised clause-sets into boolean 
clause-sets is studied under the point of view of structure preservation, taking ad- 
vantage of the fact that due to the restriction to "negative literals" we do not need 
the AMO clauses (incorporating them would destroy the structures the transla- 
tion should preserve). Besides preservation of satisfiability, minimal unsatisfiability 
and leanness, in Subsection 2.3.3 we show that also a good deal of the matching 
structure is preserved by the translation (including for example the deficiency). 
Equipped with these tools, in Theorem 2.3.5 then we obtain FPT for SAT decision 
in the maximal deficiency. 



^) Considering DNF instead of CNF, in the literature on boolean functions boolean hitting clause- 
sets are also known as disjoint DNF. 
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In Section 2.5 wc turn to the study of generalised clause-sets which are min- 
imally unsatisfiable. Considering first the larger class of irredundant generalised 
clause-sets (no clause is implied by the others) in Section |2.4| , we study the ques- 
tion when irredundancy is preserved by applying partial assignments. The class of 
irredundant clause-sets which stay irredundant for all partial assignments is charac- 



terised in Corollary 2.4.S as the class of hitting clause-sets, while in Lemma 2.4.10 



we consider the bigger class of multihitting (generalised) clause-sets and show that 
they ha ve a u nique minimally unsatisfiable core (if they are unsatisfiable). In Sub- 
section 2.5.1 we then discuss the process of "saturation" as introduced in Q; for 
generalised clause-sets we have to face a considerably more complicated situation 
here than in the boolean case, and thus it seems that for generalised clause-sets 
saturation does not play the role it d oes fo r boolean clause-sets. Without the satu- 
ration tool, proving the basic Lemma 2.5.4 for the characterisation of 7WW(1) needs 



a different trick; we use th e goo d properties of the (direct ) boo lean translation. The 
main result of Subsection |2.5.2| then follows in Theorem ^.5.5| (the characterisation 



of minimally unsatisfiable clause-sets of deficiency 1), and its two corollaries (the 
characterisation of saturated and marginal minimally unsatisfiable clause-sets of 
deficiency 1). A short review on properties related to minimal variable occurrences 
given in Subsection 2.5.3 concludes the section. 



Section 2.6 rounds off the picture, by generalising the basic results from linear 
algebra as applied to boolean clause-sets in |4j , regarding the conflict structure 
of clauses. A new translation to boolean clause-sets especially suited for these 



considerations is presented in Subsection 2.6.1, and basic invariance properties are 
shown. Then the basic results on the hermitian rank and on (boolean) hitting 
clause-sets are generalised in Subsection 2.6.2, providing the basic upper bound on 
the deficiency by the hermitian defect in Theorem 2.6.4, while corollaries apply 
these results to regular hitting clause-sets. 

Finally we present a collection of open problems and conjectures in Section 



2.2 Preliminaries 

For the basic noti ons and notations regarding variables, partial assignments and 
graphs see Section |l.2| in Part I, whil e fo r notions and notations regarding (gener- 



alised) (multi-)clause-sets see Section 1.3 in Part I. A quick review of the most basic 
definitions is as follows. 

Variables v have finite and non-empty domains D^. Generalised clause-sets 
denote conjunctive normal forms (CNFs). The literals of generalised clause-sets 
are of the form "v ^ e" for values e G D^, which are as mathematical objects 
defined just as pairs («,£); the disequalities state their meaning, which for DNFs 
(not considered here) would be the equalities "w — e" . Clauses are finite and non- 
tautological sets of literals, that is, they do not contain clashing literals, which are 
literals (w,e), {v^e') with identical underlying variables but different values e ^ e'. 
(Finite) clause-sets are finite sets of clauses (there is no principle problem with 
infinite clause-sets, however they are not considered here). So boolean variables v 
have domains Dy = {0,1}, where "w ^ 0" is the positive literal, and "w ^ 1" the 
negative literal for v. Typical examples for the use of (generalised) clause-sets arise 
from (hyper-)graph colouring problems and from problems from (exact) Ramsey 



theory; see Subsection 1.1.2 in Part I. If clauses might occur several times, then we 
use (finite) multi-clause-sets, which are maps F : CC — > No, where CC is the set of 
all clauses, such that F{C) > is the case only for finitely many clauses C . 
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A partial assignment is a map (p whose domain var((y9) :— dom((/3) is a finite set 
V of variables, such, that for all w e we have (p{v) G Dy. A partial assignments 
(fi satisfies a literal {v,e) iff u G var(iy9) and iy9(u) 7^ £, while 1^9 falsifies {v,e) if 
?; G var(iy9) and (p{v) — e. A partial assignment satisfies a clause iff it satisfies at 
least one literal in it, and it satisfies a clause-sets iff it satisfies all of its clauses. The 
operation of partial assignments ip on clause-sets F is denoted hy ip * F, which is 
the clause-set which results from F by removing all satisfied clauses, and removing 
all falsified literals from the remaining clauses. 

The conflict graph of a clause-set F has the clauses of F as vertices, while two 
(different) clauses are adjacent iff they clash in at least one literal-pair. The conflict 
multigraph has the same vertices as the conflict graph, but now there are as many 
(parallel) edges between clauses as there are conflicts between them. 

Generalising the notion of hitting clause-sets (every pair of different clauses 



clashes in at least one literal-pair; see Subsection 1.3.4 in Part I), a clause-set F 
is called at most fc-multihitting for some fc e No if the conflict graph of F is 
complete fc-partite, while F is called multihitting if it is at most fc-multihitting 
for some fc; let MHXT denote the set of all multihitting clause-sets. While "at 
most fc-multihitting" implies that the chromatic number of the conflict graph is at 
most fc, if we speak of fc-multihitting then the chromatic number of the conflict 
graph must be equal to fc (so that F is hitting iff F is c(F)-multihitting). For a 
given multihitting clause-set F there is a unique partition F of (that is, F is a 
set of sub-clause-sets of F which are non-empty and pairwise disjoint, such that 
their union is F) , so that for any clauses Ci , C2 G F with Ci G Gi and C2 G G2 
for some Gi,G2 G F the clauses Ci and C2 clash if and only if Gi / G2 (so F is 
|F|-multihitting). We call F the multipartition of F (if F is bihitting, then F is 
also called the bipartition of F). 

A basic example class is given as follows. Consider a uniform domain D = 
{!,..., fc} and variables wi,...,-y„, n > 1. Let the clause-set F^ := {{{'^ii^)} ■ 
i G {I, . . . ,n}} for £ G {1, . . . , fc} consist of all the unit-clauses for value e. Then 
F :— lJe=i is fc-multihitting with multipartition {i^^ : e G {i, . . . , fc}}. We can 
add to F all full clauses (containing all variables di, . . . , w„) containing at least two 
values, obtaining a fc"-multihitting clause-set (with fc" -|- (n — l)fc clauses). 



2.3 The direct translation 

In this section we investigate the direct translation of generalised clause-sets into 
boolean clause-sets. Different from previous research (for an overview see f?^ ), here 
we are not interested in experimental results (and how good different translations 
perform in various experiments for different SAT solvers), but we are interest in 
structure-preserving translations. At least regarding our focus on (matching) au- 
tarkies and the deficiency, the only reasonable possibility here amongst the known 
translations seems to be what in ||7^ has been coined the "multivalued encoding" , 
which is the "direct translation" , but without AMO ( "at most one" ) clauses (since 
these binary clauses would destroy the combinatorial structures we are considering) : 

• For every literal {v, s) we consider a boolean variable T[[v,e)) expressing that 
V shall not get value e. 

• Clauses C are translated into (positive) boolean clauses t{C) by replacing 
each literal x G G with the (positive) boolean literal t{x). 

• We add "ALO clauses" requiring that each variable gets at least one value (if 
it gets more than one value, then one of the values can be chosen). 
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In Subsection 2.6.1 we will present a new "nested" translation with some strong 
preservation property, but which conies at the price of breaking the symmetry 
between the possible values of a variable. A very general translation scheme is 



discussed in Subsection 2.7.3| , covering all known translations. 



In in Subsection 4.5 ("An autarky preserving reduction to boolean clause- 
sets") it has already been stated that the direct translation not only preserves 
satisfiability, unsatisfiability and minimally unsatisfiability, but also leanness. We 
have to expand these results especially regarding the notions of matching autarkies 
and deficiency, since in U& only a restricted notion of "matching autarkies" has 



been used (recall Subsection 1.9.4 from Part I), without an associated notion of 
deficiency 

Another source relevant here is Q , where "monosigned CNF formulas" are trans- 
lated, a generalisation of "generalised clause-sets" allowing also to express that a 
variable must get a certain value; in other words, where our literals (w,e) express 
"t! e" , for monosigned formulas also "positive" literals "u = e" are allowed. We 
have discussed this and further generalisations in Subsection |l.l.3| in Part I, and we 
quickly recall the relevant points here. 



• The generalisation to monosigned CNF formulas can be motivated by the fact 
that these formulas are exactly those which can be translated by the direct 
translation; however the price which has to be paid here is that now the AMO 
clauses are necessary in the direct translation! This adds further to the point 
we want to make, that generalised clause-sets in our definition (allowing only 
"negative literals") are the appropriate generalisation of boolean conjunctive 
normal forms, while further generalisations (like "monosigned formulas" ) enter 
new areas, where the combinatorics of clause-sets no longer can be applied. For 
a local search algorithm working directly with "monosigned CNF formulas" 
see (using the notion of "nb- formulas" (for "non-boolean")). 

• It is worth to mention here that in fz^ it has been shown that resolution which 
works only with generalised clause-sets, that is, where in the corresponding 
branching approach for a variable v only a branching of width \D^\ assigning 
in each branch one of the possible values to v (see [0) is considered, can 
be exponentially worse than resolution on the translation into boolean logic, 
where now branchings "w gets value e" and "u does not get value e" are 
possible. From this is follows that generalised DPLL-algorithms should not 
be restricted to branchings where in each branch a variable needs to be fixed 
to some value; however the focus of this article is not generalisation of SAT 
solvers, but generalisation of combinatorial structure, and thus we do not 
further pursue these (important) investigations. 



2.3.1 The details of the translation 

Formally, the translation proceeds as follows. We consider some bijection t : 
CIT — > V^^o.i} from the set of all (generalised) literals to the set of all boolean 
variables.^ The intended meaning of the (positive) boolean literal r((u,e)) for a 
literal (w, e) G CXT is the same as the interpretation of the original (generalised) lit- 
eral, namely "w shall not get value e" . We obtain an injection t : CC — > CC{VA{o,i}) 
by setting r(C) := {t(x) : x G C} for C G CC. Actually t : CC ^ CC{VA{o^i}) 
constitutes a bijection from CC to the set of all positive boolean clauses. The 

^'Such a bijection exists due to our assumption on VA, since the set of all literals has the same 
cardinality as the set of variables, as it is well known from elementary set theory. 
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translation t can be further extended to an injection r : CCS — )■ CCSiVA^Q ^) by 
r(F) := {r(C) : C G F} for e C/:5. Again, r : C/:5 ^ CCS{VA{o,i}) constitutes 
a bijection from the set of (generahsed) clause-sets to the set of boolean clause-sets 
containing only positive clauses. Finally, for v G VA let 

ALO„ {T((v,e)) : e G G C/:(V^{o,i}) 

be the (negative, non-empty boolean) clause expressing that v gets assigned at least 
one of the values e G Dy (that is, not all (positive) literals r((ti,e)) for e G Dy can 
be true), and let the full translation <d : CCS CCS{yA{o^i^) (which again is an 
injection) by given as 

0(F) := t{F) U {ALO^, : v G var(F)}. 

Note that the union in the definition of 9(F) is disjoint, since r(F) consists only 
of positive clauses, while {ALO„ : v G var(F)} consists only of non-empty negative 
clauses (and thus 8(F) is a "PN-clause-set" as defined in [|^). As an example, 
consider F = { {w 7^ 0, w ^ 1}, {w ^ 1, w 7^ 0}, {w ^ 2, w ^ 2} } for variables 
v,w with Dy — Dy, ~ {0, 1,2}. Now, using :— T((w,e)) and :— t{{w,£)) for 
e G {0, 1, 2} (so altogether we get six boolean variables here), we have 

'^(P) = { {ao,bi},Wi,bQ},{a2,b2},{a^,aT,a^},{bo,bi,b2} }• 

In gene ral the sub-clause-sets of 9(F) not containing pure variables (recall Sub- 
section |l.6| from Part I) are exactly the 9(F') for F' C F not containing pure 
variables. 



2.3.2 Preservation of general structure 

Regarding set-theoretical operations we have that 9 is an embedding of the semi- 
lattice iCCS,U) into {CCS{VA{o,i}),Ll), that is, for Fi,F2 G CCS we have 

9(FiUF2) = 9(Fi)u9(F2). 

Thus 9 is also an order embedding, i.e., Fi C F2 <^ 9(Fi) C 9(F2). By definition 
we have for F G CCS the equalities 

c(9(F)) = c(F) + n(F) 
n(9(F)) = J2 l^-l 

(5(9(F)) = c(9(F))-n(9(F)) = c(F)-wn(F) =(5(F), 

and thus the translation 9 preserves the deficiency of clause-sets as defined in 
Subsection |17.1| (Part I). It follows immediately that (5* (9 (F)) > S*{F) holds for 
all F G CCS, but inequality can occur here (see Subsection 2.3.31 ). 



We consider now the relations between partial assignments (p G TASS for 
F G CCS and partial assignments ip G rASS{VA{o^i}) for 9(F) G CCS{VA{o^i}). 
For (p G TASS we define the partial assignment r(c^) G TASS{VA^o^ij) by letting 
var(r((p)) := {T{{v,e)) : v G var((^),e G Dy} be the set of all boolean variables 
corresponding via the translation to literals over the variables in vai(if), while 
T{ip){(v,e)) = iff ip{v) — £. If we consider for example the partial assignment 
(w — >■ l,w ^ 2) for variables v,w with Dy — Dyj — {0, 1,2}, then, using as above 
ae := t{{v, e)) and :— t{{w, e j) for e G {0, 1, 2}, we get 

t{{v ^ 1,w ^ 2)) ^ (ao ^ 1, ai ^- 0, 02 1, 60 -> 1, 62 -> 1, 62 -> 0). 
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The partial assignments in 724iSiS(Vl4{o,i}) of the form t{lp) for some e VASS 
are called standard partial assignments (w.r.t. r). So r constitutes a bijection 
between VASS and the standard partial assignments (which are always boolean), 
and standard partial assignments Lp e VASSiyA^Q ^) are characterised by the 
condition, that whenever some T{{v,e)) G var((/5), then for all e' G we have 
t((w,£')) e var(iy9), and there is exactly one eo G Dy with iy9(r((w, Eq))) = 0; for the 
corresponding partial assignment T^^{if) G VASS we then have T~^{ip)(v) = Eq. 

In the following lemma we see that the properties of regarding touching or 
satisfying clauses are well reflected by t{lp), and hence the translation is invariant 
regarding the autarky property and the property of satisfying a clause-set. 

Lemma 2.3.1 For Lp e VASS, C e CC and F e CCS we have 

1. ip touches resp. satisfies C if and only if T{ip) touches resp. satisfies t{C). 
Thus 

T(-Fvar((^)) = T(i^)var(T(vp)) 

e(F[var(95)]) = e(i^)[var(T(^))]. 

2. t{lp) is an autarky for the set of clauses {ALOi, : v G V14}. 

3. p is an autarky for F if and only if t{lp) is an autarky for Q{F). 

4- If t{lp) satisfies Q{F), then Lp satisfies F. If on the other hand Lp satisfies F 
and var((/3) 3 var(i^) holds, then t{p) satisfies Q{F). 

Proof Parts ^ || follow directly from the definitions, while Part ^ follows from 
Parts |l|, |, and Part | follows from Parts |l|, |. I 

For the reverse direction, translating partial assignments in TASS{VA^o^ij) to 
partial assignments in VASS, call p E JASS{VA[o,i}) admissible if ip is an autarky 
for the set of clauses {ALO„ : v £ VA}, that is, if T{{v,e)) G va,i{(p), then there is 
eo G Dv with (p{t{{v, Eq))) = 0. In words: a partial assignment ip for the boolean 
variables is admissible iff for every variable r ((«,£)) in its domain there exists a 
value £o G Dy such that T{(v,eo)) is in the domain of ip as well with ip{T{{v, Eq))) = 
0. Note that an autarky ip G TASS{VA{o^i}) for 6(F) (this includes satisfying 
assignments) is admissible in case oi va.r{ip) C var(8(F)). 

Call a standard partial assignment t/j G VASS {VA{o,i}) a standard comple- 
tion of an admissible ip G 72455 (V^{o,i}) if V' touches (satisfies) exactly the same 
ALO-clauses as ip, and if from ?/;(t((i;, e))) — always follows ip{T{{v,e))) — 0. 
In other words, a standard completion ip of an admissible ip is obtained from ip 
by considering all variables v such that e E Dy with r((w,e)) G var((p) exists, 
choosing eo{v) G Dy with ip{T{{v, eo{v)))) = 0, and setting 'i(j{t{{v, s' j)) := 1 for 
s' eDy\ {eoiv)}, while iP{t{{v, eo{v)))) := 0. 

The standard completions of admissible partial assignments in TASS{VA{o^i}) 
are exactly the t{9) for 9 G VASS. The purpose of standard completions ip of 
admissible partial assignments ip is to transform as follows an (arbitrary) autarky 
ip for Q{F) into an autarky ip for Q{F) which is also a standard partial assignment: 

1. First we restrict ip to var(0(F)) to be sure it is admissible. 

2. Then we choose a standard completion ip. 



Now by Lemma 2.3.1 (Part m we obtain from ip an autarky for F. The following 



lemma (with obvious proofs) states the basic properties of standard completions. 
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Lemma 2.3.2 For C G CC and F £ CCS, an admissible tp G 73455(Vl4{o,i}) a^*^ 
a standard completion G VASSiVA^Q ^) of Lp we have 

1. If touches resp. satisfies t{C) then ip touches resp. satisfies t{C). 

2. If If is an autarky for 0{F) then -ijj is an autarky for 0{F). 

Lemma 2.3.3 For a (generalised) clause-set F G CCS we have: 

1. F e SAT ^ e(F) G SAT. 

2. F e MU <^ e{F) G MU. 

3. Fe CEAJV ^ e(F) G CEAAf. 



Proof If F G SAT then e(F) G SAT with Lemma |2.3.l| , PartW, and if e(F) G 
SAT, then F G SAT with Lemma j^U, Part and Lemma Part |. 



li F ^ A4U, but there were some minimally unsatisfiable F* C 0(F), then there 
would be F' C F with 6(F') = F* (since F* does not contain pure variables), and 
thus F' would be unsatisfiable by Part |^. If on the other hand 6(F) G AiU, but 
there were some unsatisfiable F' C F, then 8(F') would be unsatisfiable as well by 
Part I. 



Finally, if F G CSAAf then e(F) G CSAAf by Lemma Part p| and Lemma 

Oil , Part |, and if e(F) G C£AJ\f then F G C£AJ\f by Lemma |2.3.l| . Part | (the 



other direction). 

Parts |l| and ^ have been concluded in Corollary 20 in |Q from the stronger 
property Na(6(F)) = 0(Na(F)) (recall that Na is the lean kernel operator); in this 
article we do not go further with the study of the translation Q, but we restrict 
ourselves to the minimum required to understand our applications. 

2.3.3 Preservation of matching structure 
Lemma 2.3.4 For tp G VASS, C eCC and F e CCS we have 

1. If t{p) matching satisfies 0{F), then ip matching satisfies F. 

2. If t{p) is a matching autarky for 6(F), then p is a matching autarky for F . 

Proof If the partial assignment t{p) matching satisfies 8(F), then (by definition) 
for each clause D G 9(F) one can choose a literal xd ^ D with p{xd) = 1, such 
that for the variables var(a;£)) = t{{vd,£d)) the map D G 6(F) n> t{{vd,£d)) is 
injective (whence D G 6(F) i-> {vd,£d) is injective). Now the map C ^ F i—^ '^t(c) 
has for each image w^(c) ^t most (|F't,| — l)-many inverse images, since for each 
£ G there is at most one D G 6(F) with vd = ^t{c) ^-nd e^) = e, and exactly 
one of these D is the clause ALO^^ . 

For Part || recall that ^ is a matching autarky for F iS p matching satisfies 
F[var(iy9)], which by P art |l| follows from t{p) matching satisfying 6(F[var(i^)]), 
where by Lemma 2.3.1 , Part |l| we have Q{F[va.T{p)]) = 6(F)[var(r(iy9))], and thus 



the latter assertion is equivalent to t{p) being a matching autarky for 6(F). 

Lemma 19, Part (l)(d) of |Q rephrased in the terminology of Subsection 1.9.4 
(Part I) says that if (/? is a non-repetitive matching autarky for F, then t(p) is a 
matching autarky for 6(F); in follows then in Corollary 20 of that if 6(F) 
is matching lean, then F is lean w.r.t. non-repetitive matching autarkies. These 
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properties do not hold for raatching autarkies in general (in the presence of non- 
boolean variables), as the following examples show. 

An example, where a matching autarky ^p for a (generalised) clause-set F G CCS 
does not yield a matching autarky r(</j) for Q{F), is given by mwte-clause-sets as 
follows: 

1. Consider the multi-clause-set := 2 • {(t;,0)} for a variable v with Dy = 
{0,1,2}. 

2. Fx is matching satisfiable (via {v — )• 1) or {v 2)). 

3. F\ is lean w.r.t. non-repetitive matching autarkies. 

4. The translation is Q{Fi) = 2-{t{{v, 0))}+{r((t;, 0)), t{{v,1)), r((t;,2))}, where 
var(e(i^i)) = {t{{v,Q)),t{{v,1)),t{{v,2))}. 

5. Nnia(6(-Fi)) — 2 • {r((u, 0))} = t{Fi) (via matching autarkies we can only 
eliminate the ALO-clause). 

6. Thus 0(Fi) is not matching satisfiable. 

One sees that the problem with transferring matching autarkies from generalised 
(multi-)clause-sets to their boolean translation lies in the possibility that a matching 
in the clause-variable graph B{F) might use the same literal several times, which 
is not possible for the translated literals. To obtain an example using clause-sefs, 
consider additionally two boolean variables w, w' and let 

F2 = {{v^Q,WTt 0}, {v 7^ 0, 0}, {w 1}, {w' 1} }. 

The partial assignment <^ := — >■ 1, w — >■ 0, w' — >■ 0) is matching satisfying for F^ 

(note that again F2 is lean w.r.t. non-rcpctitivc matching autarkies), but t{}p) is 
not a matching autarky for 0(i^2), and moreover the matching lean kernel of 6(^2) 
is 6(^2) \ {ALO„} (again only the ALO-clause for v can be eliminated via matching 
autarkies), and thus 8(^2) is not matching satisfiable. Furthermore we have in this 
example (5*(F2) = and 5*(e(F2)) = 5(e(F2) \ {ALO^) = (5(9(^2)) - (1 - 2) = 
^(Fs) + 1 = 1. 

Now consider the transfer of matching autarkies in the other direction, that is, 
wc have given a matching autarky 99 G 7J4iS5(V^{o.i}) for 0(F), and wc want to 
obtain some associated matching autarky for F. The problem here is that might 
use some variable r((w, e)), but not a variable r((7;,e')) for some e' € \ {e}, and 
such situations can not be translated back to F. The simplest example for this 
phenomenon is (again) given by a ?nu/ti-clause-set as follows: 

1. Let F3 := {(v, 1)} + 2 • {(v, 2)} for a variable v with £)„ = {0, 1, 2}. 

2. Obviously F3 is matching lean. 

3. The translation is 

e(F3) = {r((^;, 1))} + 2 • {t{{v, 2))} + {7(^0)), 7(M)), ^(M)}, 

where var(e(F3)) = {t((«, 0)), t((«, 1)), t((«, 2))}. 

4. Thus Nma(0(F3)) = r(2 • {(y,2)Y) via the matching autarky (t((?;,0)) 
0,t((z;,1))^1). 
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A clause-sei -Flj, where F4 is matching lean but 0(i^4) is not, is given by 

:={{v^ 1}, {v ^ 2}, {v^2,w^ 0}, {w^l}} 
for an additional boolean variable since here 

N„,a(e(i^4)) = t{{{v ^ 2}, {v^2,w^ 0}, {w ^ 1}}) U {ALO„} 
via the matching autarky (t((w,0)) — > 0, r((w,l)) — > 1) for 6(i^4). 

2.3.4 Polynomial time SAT decision in the maximal defi- 
ciency 

As we have seen now, matching autarkies for (generalised) clause-sets F G CCS 
and matching autarkies for Q{F) G CCS{VA[o,i}) in general are incomparable. 
Nevertheless we can use them to show fixed-parameter tractability for generalised 
clause-sets w.r.t. the parameter S*{F) as follows. 

Theorem 2.3.5 SAT decision for (generalised) clause-sets F G CCS can be done 
tn t«meO(2^-*(^)-(E„(F)l^«l)') 

Proof Consider F G CCS and let F* be the result of reducing Q{F) w.r.t. matching 
autarkies and pure autarkies (thus F* is the unique maximal sub-clause-set of F 
which is matching lean and does not contain pure variables). We can compute F* in 
polynomial time, and F* is satisfiability equivalent to F. Since F* contains no pure 
literals, it corresponds to a sub-clause-set of F, and thus we have 6{F*) < S*{F), 
and since F* is matching lean we have S*{F*) — 6{F*). Theorem 4 in Q says 
that satisfiability of F* can be tested in time 0{2^'^^''^ ■ n(F*)^), where in this 
procedure actually already the cost of reducing Q{F) to F* is included if we use 
n{Q{F)) instead of n{F*) in the big- Oh expression (see Section 5 in or use the 



argumentation of Subsection 1.11 of Part I), and the theorem follows. 



2.4 Irredundant generalised clause-sets 

One of the motivations behind the notion of lean clause-sets can be seen in "ap- 
proximating" the fundamental notion of minimally unsatisfiable clause-sets. In this 
section we will now consider some of the basic facts on the more general notion 
of "irredundant clause-sets" (that is, every clause contributes something unique, 
allowing also to consider satisfiable clause-sets) in our generalised setting. Detailed 
studies of irredundant clause-sets in the boolean case can be found in the following 
references: 

1. (speaking of "clause minimal formula" ) focuses on questions related to the 
problem (from a complexity theoretical perspective) when for given clause-sets 
F, H there exists a clause-set G such that F U G is equivalent to H. 

2. considers in various forms (also mostly from a complexity-theoretical per- 
spective) the problem of finding an irredundant core in a given clause-set. 



We start in Subsection 2.4.1 with a discussion of the notion of "irredundant 



clause-sets" , concentrating on the basic question of preservation of irredundancy un- 



der application of partial assignments. In Subsection 2.4.2 we consider the in some 



sense most extreme case of irredundant clause-sets, namely "hitting clause-sets" : 
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every two different clauses clash, that is, have no common falsifying assignment, or, 
in other words, the conflict graph is complete. Furthermore we consider the nat- 
ural generalisation to "multihitting clause-sets" (the conflict graph is multipartite; 
see Section p^ ). In Corollary 2.4.8 we show that hitting clause-sets are exactly 
those clause-sets which are irredundant after application of every partial assign- 
ment, and thus unsatisfiable hitting clause-sets are exactly those clause-sets which 
are minimally unsatisfiable after application of every partial assignment (Corollary 
2.4.S). For unsatisfiable multihitting clause-sets we show in Lemma [2.4.10 that 
they have exactly one minimally unsatisfiable sub-clause-set (which can be com- 
puted efficiently by subsumption-elimination) , and in Lemma 2.4.12 we show that 
the satisfiability problem for bihitting clause-sets (where the confiict graph is bi- 
partite) is solvable in quasi-polynomial time (this problem is essentially the same 



problem as the hypergraph transversal problem). We conclude in Subsection 2.4.3 



by considering "irredundant cores" , that is, minimal equivalent sub-clause-sets, in 
general. The main result here is Lemma 2.4.13, expressing the duality between 
irredundant cores and maximal non-equivalent sub-clause-sets via the formation of 
hypergraph transversals. 

The special case of unsatisfiable irredundant clause-sets (i.e., minimally unsat- 



isfiable clause-sets) is considered in Chapter 2.5 



2.4.1 Irredundant clause-sets 

A clause C G F is called redundant (or unnecessary) for clause-set F £ CCS if 
P \ {^} H ^ holds, while otherwise C is called irredundant (or necessary) for 
F. The following conditions are equivalent for a clause C £ F: 

• C is redundant for F. 

• F \ {C} is equivalent to F. 

• The set falvar(F)(C') of falsifying assignments for C (w.r.t. the variables of F) is 
covered by the family (falvar(_F)(C"))c"gF\{c} of sets of falsifying assignments 
for the remaining clauses. 

A (generalised) clause-set F e CCS is called irredundant if all C G F are ir- 
redundant for F, otherwise F is called redundant. A clause-set F is minimally 
unsatisfiable if and only if F is unsatisfiable and irredundant. Regarding complexity 
classifications of decision problems related to (ir)redundancy we have the following: 

1. In it is shown that the decision problem whether a (boolean) clause-set is 
irredundant is NP-complete, while the decision problem whether a (boolean) 
clause-set is minimally unsatisfiable is _D^-complete. Trivially these results 
also hold for generalised clause-sets. 



As we have seen in Theorem 2.3.4 SAT decision for (generalised) clause-sets 



is fixed-parameter tractable in the maximal deficiency, and thus also irredun- 
dancy decision is fixed-parameter tractable in the maximal deficiency. Since 
for minimally unsatisfiable (generalised) clause-sets maximal deficiency and 
deficiency coincide (Corollary [1.9. 9| in Part I), minimally unsatisfiability deci- 
sion is also fixed-parameter tractable in the deficiency; however, as shown in 
Proposition 1 in [Q, the decision whether a (boolean) clause-set is irredun- 
dant with deficiency k is NP-complete for every fixed k G N (different from 
minimally unsatisfiable clause-sets, irredundant clause-sets of deficiency k can 
contain sub-clause-sets of arbitrary deficiency) . Obviously the same holds for 
generahsed clause-sets. 
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We are interested here in the question, given a partial assignment (p and a clause 
C & F with If * {C} 7^ T (i.e., C is not satisfied by ip), under what circumstances 
is the clause (yS * C = C \ redundant for ip * F 1 We will see that this question 
is closely related to the question, how "much irredundant" C is for F, that is, how 
much of falvar(F)(C') is covered by (falvar(F)(C"))c'eF\{c}i which can be recast as 
the question, whether for some C" ^ C we have F \ {C} \= C . 

Assume that iy9*C is redundant for tp^F, that is, {p*F)\{(p-'f{C}) \= ip*C holds. 
Due to {ip*F)\{<p* {C}) Cip*{F\ {C}) it follows (p*{F\ {C}) \=^*C, which is 
equivalent to F\{C} \= CUC^. Let us call C tp-redundant for F if F\{C} |= CU 
Cip holds, and otherwise (^-irredundant. In other words, C is iy9-redundant for F iff 
the part of falvar(F)(C') which consists of assignments compatible with ip is covered 
by (falvar(_F)(C'))ce-F\{c}- Obviously, C is redundant for F iff C is 0-redundant for 
F, and if C is (^-redundant for F, then C is also (^'-redundant for F for every partial 
assignment ip' with ip <Z ip' . So (/?-redundancy generalises (ordinary) redundancy 
by weakening it, while (/?-irredundancy strengthens (ordinary) redundancy. For 
an example consider boolean variables a, b and the irredundant clause-set F = 
{{a}, {b}}: {b} is (a 0)-redundant for F. 

If C is (p-irredundant for F, then (p*C is irredundant for (p* F, but the reverse 
direction is not true in general due to the fact that there might be other clauses 
C £ F with p * C — p * C. To repair this, let us call clause C contraction-yj- 
redundant for F if 

F\{C' eF:p* {C} = ip * {C}} h C U C^, 

while otherwise we call C contraction-y-irredundant for F. We summarise (and 



extend) the foregoing discussion in Lemma 2.4.1, whose proof should be obvious by 
now. 

Lemma 2.4.1 Consider a generalised clause-set F G CCS, a clause C £ F and a 
partial assignment ip £ VASS such that ip * {C} ^ T . 

1. ip*C is (ir ) redundant for p* F if and only if C is contraction-ip-(ir)redundant 
for F. 

2. (a) If C is ip-irredundant for F , then C is contraction-p -irredundant for F . 

(h) If there is no clause C Cz F \ {C} with p * {C} = ip * {C} (that is, C is 
"contraction- free" in F w.r.t. ip), then also the reverse direction holds, 
that is, if C is contraction-ip -irredundant for F then C is p-irredundant 
for F . Clause C is contraction-free in F w.r.t. ip in the following cases: 

(i) n{ip) =0 (i.e., ip is the empty partial assignment) ; 
(a) n{p) = 1 and F is subsumption-free; 
(Hi) C clashes with every C G F \ {C}. 

Corollary 2.4.2 Consider a generalised clause-set F G CCS which is subsumption- 
free, a clause C G F and a variable v G VA together with a value e G Dy such that 
for all e' G \ {e} we have {v,e') ^ C. Then {v ^ e) * C = C \ {{v,e)} is 
irredundant for (v e) * F if and only if C is {v — > e) -irredundant for F, that is, 
^ff F \ {C*} ^ CU {(w,e)}. In other words, a non-satisfied clause is irredundant 
after application of an elementary partial assignment (for the new clause-set) iff the 
original clause is irredundant (for the original clause- set) even after addition of the 
falsified literal. 

Obviously irredundant clause-sets are subsumption-free, and from Corollary 



2.4.2 we get immediately: 
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Corollary 2.4.3 Consider an irredundant generalised clause-set F £ CCS, a clause 
C £ F and a variable v G VA together with e G Dy . 

1. If there exists e' € Dy \ {s} with {v,e') G C, then clause C vanishes when 
applying {v e) to F (and in that sense it becomes redundant in {v — >■ e) ). So 
assume valt,({C}) C {e} in the sequel (that is, C is not satisfied by (v — > s) ). 

2. If (w, e) G C , then {v ^ e) * C — C \ {{v, e)} is irredundant for (u — > e) * F. 

3. If {v,e) ^ C, then C is irredundant for {v e) * F if and only if C is 
{v ^ e) -irredundant for F , i.e., iff F\{C} ^ C U {(i;, e)}. 

In other words, considering the application of an elementary partial assignment to 
an irredundant clause-set, new clauses (produced by the application) are definitely 
irredundant, while untouched clauses stay irredundant iff they stay irredundant in 
the original clause-set after addition of the falsified literal (so that they fall then 
actually under the first case). 

Considering a clause C G F, we called C redundant for F iS F\ {C} \= C; now 
for arbitrary clauses C we can call C "dependent" on F if F |= C holds (that is, 
if the set of falsifying assignments of F covers the set of falsifying assignments of 
C), and otherwise "independent". If C G F, then C is dependent on F, while C 
is redundant for F iff C is dependent on F \ {C}. The relation of C depending on 
F allows two dimensions for minimisation: Considering a minimal clause C which 
is dependent on F we arrive at the notion of a prime implicate of F (an implied 
clause, which ceases to be implied after removal of any literal), while considering a 
minimal clause-set F such that C depends on F we arrive at a minimal premise set 
for C. The following lemma states the relation between minimal premise sets and 
minimally unsatisfiable clause-sets. 

Lemma 2.4.4 Consider a generalised clause-set F G CCS and a clause C G CC. 
Then the following assertions are equivalent: 

1. F is a minimal premise set for C . 

2. ipc * F is minimally unsatisfiable, no clause of F is satisfied by Lpc, and F 
is ipc -contraction free, that is, there are no clauses D,D' G F, D ^ D' , with 
ipc * {D} = ipc* {D'}. 

Corollary 2.4.5 // clauses C, D are prime implicates of the generalised clause-set 
F , and if F is a minimal premise set for C as well as for D, then C = D holds. 

Proof Assume w.l.o.g. that there is a literal x G C\D. Since ipc does not satisfy 
any clause of F, while C is minimal, the literal x occurs in F, and all occurrences 
of var(a;) in F are equal to x (while |£'var(x)| > 2). Furthermore var(x) ^ var(Z?) 
(since ipo does not satisfy any clause of F), and then ip^ * F contains the pure 
variable var(a;) and thus can not be unsatisfiable. I 

Even an irredundant clause-set F may not be a minimal premise set of any clause 
C (consider for example the boolean clause-set {{a}, {6}}), and for a prime implicate 
C of F there might be several minimal premise sub-sets of F (consider for example 
C = jg, b } and the boolean clause-set {{a,x},{x,b},{a,y},{y,b}}). However by 



Corollary p.4.5| for a clause-set F there can be at most one prime implicate C such 
that F is a minimal premise set of C, and thus we get (by considering all non-empty 
sub-clause-sets): 
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Corollary 2.4.6 A generalised clause-set F can have at most 2'^^^^ ^ many prime 
implicates. 



See the end of Subsection 2.6.2 for some comments on the sharpness of this bomid. 



2.4.2 Hitting and multihitting clause-sets 

The next lemma answers the question which clauses C remain irredundant for a 
clause-set F under all applications of partial assignments; this strongest form of 
irredundancy of C for F turns out to be equivalent to the condition that the set 
of falsifying assignments for C is not covered at all by (falvar(F) (C'))c'ei='\{c}- A 
simple but important observation here is that for two clauses C, C" and var(C) U 
var(C") C y we have faly(C) n falv(C") = iff C and C" clash. 

Lemma 2.4.7 Consider a generalised clause-set F G CCS and a clause C G CC. 
Then the following assertions are equivalent: 

(i) C is if -irredundant for F for all ip G TiASS. 
(a) C is contraction-ip -irredundant for F for all ip G TiASS. 

(Hi) falvar(F)(C) n \Jc'eF\{C} falvar(F)(C") = 0. 

(iv) C clashes with every C Cz F \ {C}, i.e., clause C is connected in the conflict 
graph cg{F U {C}) to every other vertex. 

Proof By the above remark we see that (|i]|) a nd ([^are equivalent. By definition 



( |iii[ ) is equivalent to (|i|), while by Lemma 2.4.1, part ^it is equivalent to (^j 



We remind at the notion of a "hitting clause-set" (see Subsection [1.3.4 in Part 



I), where every two different clauses clash in at least one variable (have literals 
with the same variables but with different values). This is equivalent to the sets of 
falsifying assignments for different clauses being disjoint. 

Corollary 2.4.8 A generalised clause-set F G CCS is a hitting clause-set if and 
only if for all if G VASS the clause-set ip * F is irredundant. 

Obviously the application of a partial assignment to a hitting clause-set produces 
again a hitting clause-set. So we also have the simple fact, that F is hitting iff for 
all partial assignments ip * F is hitting. Generalising Theorem 32 in ||49| : 

Corollary 2.4.9 A generalised clause-set F G CCS is unsatisfiable hitting if and 
only if p * F is minimally unsatisfiable for every p G VASS . 

Hitting clause-sets are irredundant; the more general class of multihitting clause- 
sets (clause-sets with complete multipartite conflict graph) contains redundant 
clause-sets, but all redundancies can be removed efficiently (and canonically) , as 
the following lemma shows. We use the notion of an irredundant core of a 
clause-set F G CCS which is an irredundant F' C F such that F' is equivalent to F 
(in | |6^ the notion "irredundant equivalent subset" is used). An irredundant core 
of an unsatisfiable clause-set is called a minimally unsatisfiable core (or simply 
"minimally unsatisfiable sub-(clause)-set" , often abbreviated by "mus"). 

Lemma 2.4.10 Consider a generalised clause-set F G CCS without trivial vari- 
ables which is multihitting. Let ¥ be the multipartition of F, and V :— var(F). 
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1. For Fi, F2 e F, Fi ^ F2 we have faly(Fi) n falv(F2) = 0. 

2. If for F' C_ F and C E F \ F' we have F' |= C , then there must be some 

C e F' with C c C. 

3. F has exactly one irredundant core, which is obtained from F by suhsumption- 
elimination. Thus if F is unsatisfiable, then F has exactly one minimally 
unsatisfiable core, which is obtained from F by sub sumption- elimination. 

4-. A hitting clause-set F is unsatisfiable iff '^ceF\^^^yi{^})\ ~ \T^SS{V)\. 

Proof Part |l] follows by definition. In Part ^ it is falydC}) covered by faly(F'), 
and thus by Part ^ in fact falydC}) is covered by falv(F' n Fc), where Fc £ F 
with C e Fc; i.e., FcCiF' ^ {C}. By the strong completeness of resolution and the 
fact that within Fc no clashes exist, it follows that there must be C € F' DFc with 
C C C. Part ^ follows immediately from Part ||. Finally Part ^ follows immediately 
from Part |l]. I 

Corollary 2.4.11 A multihitting clause-set is irredundant if and only if F is sub- 
sumption-free. Thus an unsatisfiable multihitting clause-set is minimally unsatisfi- 
able if and only if F is sub sumption- free. 



By Corollary |2.4.11| we know that deciding whether a multihitting clause-set is 
minimally unsatisfiable is the same task (up to subsumption elimination) as deciding 
whether it is unsatisfiable. Obviously AdHITDUSAT is in co-NP (and thus also 
AiTLTT n MU). We have more precise information only for special cases: 

• Using |falvar(F)(C)| = nt,evar(F)\var(c)l-Oi'l ^o^' C* ^ F it follows that satisfia- 
bility for hitting clause-sets is decidable in polynomial time (generalising the 
well-known special case for boolean clause-sets). 

• For boolean bihitting clause-sets in |Q it was shown that satisfiability decision 
can be done in quasi-polynomial time (where "quasi-polynomial" means a 
"polynomial" upper bound but where the exponent is allowed logarithmic 
growth in the size of the input), since satisfiability decision for bihitting clause- 
sets is essentially the same as deciding whether for two given hypergraphs one 
is the transversal hypergraph of the other. 

The second point can immediately be generalised as follows. 

Lemma 2.4.12 Satisfiability for bihitting generalised clause-sets is decidable in 
quasi-polynomial time. 

Proof Variables with a domain size greater than two appearing in a bihitting 
clause-set must be pure variables, since if a generalised clause-set contains a variable 
of domain size k, then the conflict graph contains the complete graph Kk (which is 
not bipartite). I 

It seems to be a very interesting question, to what degree (generalised) multihit- 
ting clause-sets have efficient satisfiability decision (see Subsection 2.7.2| for further 
discussion, and see pGl for more information in the boolean case). 
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2.4.3 Ir redundant cores 



A "core" of an unsatisfiable (boolean) clause-set is just some unsatisfiable sub- 
clause-set; typically one is interested in minimally unsatisfiable cores, and in this 
short section we generalise some basic observations in [|4| from the boolean context 
to generalised clause-sets, and now also considering satisfiable clause-sets. 

Recall that a (finite) hypergraph is a pair (V, E) such that is a (finite) set 
(the "vertex set") and E is a set of subsets of V (the set of "hyperedges" ) . For 
a hypergraph G by min(G) resp. max(G) we denote the hypergraph with the 
same vertex set and with all inclusion-minimal resp. maximal hyperedges from G. 
Consider a (generalised) clause-set F. Let EQ(F) be the hypergraph with ver- 
tex set F (the clauses of F), while the hyperedges are all subsets of F which are 
equivalent to F, and let NEQ(i^) be the hypergraph with vertex set F and hyper- 
edges the subsets of F which are not equivalent to F. If F is unsatisfiable, then 
EQ(i^) = USAT{F), the hypergraph consisting of all unsatisfiable sub-cause-sets 
of F, while NEQ(F) = SAT{F), the hypergraph of all satisfiable sub-clause-sets 
of F. Now min(EQ(F)) is the hypergraph consisting of all irredundant cores of F\ 
if F is unsatisfiable then min(EQ(i^)) = MU(i^), the hypergraph of all minimally 
unsatisfiable cores of F. 

Generalising |6|, the elements of p|EQ(F) p| min(EQ(i^)), the clauses 
which are in every irredundant core of F, are called necessary clauses. Following 
[|6^ , the elements of IJ min(EQ(F)), the clauses which are in some irredundant core, 
are called potentially necessary clauses (in |]68| such clauses are called "useful"). 
We see that necessary clauses are exactly the irredundant clauses as defined before. 
Regarding decision complexity we have: 

1. A clause-set F is satisfiable iff ± is necessary for F U {J-}, and thus already 
for (boolean) unsatisfiable clause-sets decision whether a clause is necessary 
is NP-complete (this was noticed for (arbitrary) boolean clause-sets in Theo- 
rem 3 in |]68| , and trivially also the decision problem whether some clause is 
necessary for an generalised clause-sets is NP-complete as well) . 

2. By Theorem 4 in we have that decision whether a clause G is potentially 
necessary for a (boolean) clause-set F is E2 -complete (where is the class 
of problems reducible to the decision problem whether a quantified boolean 
formula with quantifier-prefix E1*V* is true). Trivially this holds also for all gen- 
eralised clause-sets, and due to IJ min(EQ(F U {L})) = {_L} U |J min(EQ(F)) 
we can restrict F here again to unsatisfiable clause-sets. 

3. In Theorem 5 in it is shown that decision whether a (boolean) clause- 
set has a unique irredundant core is [logn]-complete (where A2 is the 
class of problems decidable in polynomial time by arbitrary use of an NP- 
oracle, while for A2 [logn] only logarithmically many oracle calls are allowed). 
Obviously this carries over to generalised clause-sets, however whether again 
restriction to unsatisfiable clause-sets is possible (that is, deciding whether an 
unsatisfiable clause-set has a unique minimally unsatisfiable core) is not clear. 

Finally we can also generalise the observation of Bailey and Stuckey, independently 
also made in (Theorem 2), and exploited in where we use the same (sim- 
plified) proof as in Q (Section 2): For a hypergraph G denote by Tr(G) the 
hypergraph with the same vertex set V{G)^ while the hyperedges are the minimal 
transversals of G (minimal subsets of V{G) intersecting every hyperedge), and de- 
note by C(G) the hypergraph with vertex set V{G) and hyperedges V{G) \ H for 
H e E{G). 
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Lemma 2.4.13 For every (generalised) clause-set F we have 
min(EQ(F)) = Tr(C(max(NEQ(F)))). 

Proof The assertion is equivalent to C(Tr(min(EQ(i^)))) = niax(NEQ(F)), which 
just states that the maximal non-equivalent sub-clause-sets of F are exactly the 
maximal independent vertex sets of min(EQ(F)), i.e., those maximal sets of clauses 
not containing an irredundant core. I 



2.5 Minimally unsatisfiable generalised clause-sets 



This chapter is about the basic facts regarding (generalised) unsatisfiable irredun- 
dant clauses-sets, that is, minimally unsatisfiable (generalised) clause-sets. See |Q 
for an overview on minimal unsatisfiability (and extensions) in the boolean case. 
The basic parameter structuring our considerations is the deficiency, and especially 
the lowest possible deficiency is considered. 



In Subsection 2.5.1 "saturated minimally unsatisfiable clause-sets" are discussed 
(no literal occurrence can be added without destroying minimal unsatisfiability); 
this is a concrete example where generalised clause-sets behave essentially more 
complicated than boolean clause-sets. In Subsection 2.5.2 we characterise mini- 



mally unsatisfiable generalised clause-sets of deficiency one as well as the special 
cases of saturated and marginal clause-sets. Finally in Subsection ^.5.3 we collect 
some observations which might serve for further progress in the characterisation of 
minimally unsatisfiable clause-sets. 



2.5.1 Saturated minimally unsatisfiable clause-sets 

A clause-set F G CCS is called saturated minimally unsatisfiable if F is un- 
satisfiable, but for any clause C € F replacing C in F by C U {x} for any literal x 
with var(a::) ^ var(C) and |£'var(£!;)| > 2 yields a satisfiable clause-set.0 Saturated 
minimally unsatisfiable clause-sets are minimally unsatisfiable (consider x such that 
var(a;) ^ var(_F)), and actually a clause-set F is saturated minimally unsatisfiable 
iff it is minimally unsatisfiable and addition of a literal x with var(a;) £ var(_F') to 
any clause C with var(a;) ^ var(C) yields a satisfiable clause-sets. The se t of all 
saturated minimally unsatisfiable clause-sets is called SM-IA. By Lemma 2.4. 10| , 
part ^ we see that unsatisfiable hitting clause-sets are in SM.U. 

Lemma 2.5.1 Every minimally unsatisfiable clause-set F G AAU can be satu- 
rated, that is, there exists F* G SA414 with var(i^*) — var(i^) and a bijection 
TT : F ^ F* such that for all C e F we have C C 7r(C). 

Proof The observation needed here is that if for a minimally unsatisfiable clause- 
set F we replace some clause C G F by a clause C" D C, obtaining F' := (F\{C})U 
{C}, then F' is minimally unsatisfiable if F' is unsatisfiable (the only possibly 
redundant clause in F' is C", and if C" is redundant in F', then F' is satisfiable, 
since F'\{C'} = F\{C} G SAT). So we can add literals x with var(a;) G var(F) to 
clauses such that we maintain (minimally) unsatisfiability, and finally we will end 
up with a saturated F*. I 

For boolean clause-sets the characterisation of SM.U from Lemma C.I in |Q 
is fundamental: A minimally unsatisfiable boolean clause-set F is saturated if and 



^'instead of "saturated" in "strong" is used, and in "maximal"; we follow 
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only if for every variable v G var(F) and each e G Di, ~ {0, 1} it is {v —i' e) * F 
minimally unsatisfiable. Together with saturation this characterisation provides a 
powerful method for proving properties of minimally unsatisfiable clause-sets via 
induction on the number of variables. For generalised clause-sets saturatedness is 
weaker, and the above condition is only sufficient for being minimally unsatisfiable, 
but is no longer necessary. The following lemma develops these fundamental facts, 
using the following notion: We say that addition of literal x renders clause-set F 
satisfiable iff for all clauses C G F with var(a;) ^ var(C) the clause-set {F \ {C}) U 
{C U {x}} is satisfiable (thus a clause-set F is saturated minimally unsatisfiable iff 
F is unsatisfiable and addition of any literal renders F satisfiable). 

Lemma 2.5.2 Consider a generalised clause-set F e JVIU and a literal (w,£) G 
CIT. 



1. If (w — !■ e) * F e MU, then for all e' G Dy\ {e} addition of literal {v,e') 
renders F satisfiable. 

2. If V is boolean, and for e' G Dy \ {s} addition of literal {v, e') renders F 
satisfiable, then we have (v e) * F E A4U. 

Proof For Part ^ assume that there is C G F, i; ^ var(C) and e' G Dy \ {e} such 
that F' := {F \ {C}) U {C U {{v, e')}} is unsatisfiable. Then {v ^ e) * F' e US AT 
with {v — > e) * F' = {{v e) * F) \ {C}, and thus C would be redundant in 

{v ^ e) * F. 

For Part | assume that (w — > e) * _F is not minimally unsatisfiable; by Corollary 



2.4. 3| thus there is a clause C e F, v var(C) such that F \ {C} \= C U {(w,e)}. 
It follows that for F' := (F \ {C}) U {C U {(«,£')}} we have F' \= C (using one 
resolution step), and thus F' would be unsatisfiable. I 

Corollary 2.5.3 If for the generalised clause-set F G CCS for every partial assign- 
ment (p G HASS with n{ifi) < 1 we have ip * F G MU, then F G SM.U. If F is 
boolean, then also the reverse direction holds, that is, F G SM.IA if and only if for 
every partial assignment ip G VASS with n{ip) <1 we have (f * F € AiU. 

An exam ple, s howing that the implication "F G SA41I ^ {v e) * F G M.W 
in Corollary ^2.5.5 does not hold for generalised clause-sets, is as follows: Consider 



variables a, h with Da = Di, = {0, 1, 2}, and let F be the following clause-set with 
4 binary clauses and 2 unary clauses: 

F := { {a ^ 0,M 0}, {a ^ 1,M 0}, {a 0,M 1}, {« 7^ 1,M 1}, 

{a^2}, {M2}}. 

We have F G SM.U (after unit-clause elimination we obtain a boolean clause-set 
with all possible (full) clauses), while {a 2) * F = {_L, {h ^ 2}} ^ MU (as well 
as (6 — ?• 2) * F = {_L, {a ^ 2}} ^ MU). It might be worth investigating the class 
of (generalised) clause-sets F such that for all partial assignments ip with n{(p) < 1 
we have (p * F G M.U (a strict subset of SMU); see Subsection 2.5.3| for first 
observations. 

An important application of the process of saturation for boolean clause-sets is 
given by Lemma C.2 in [Q, proving that for every F G M.U, F ^ {L} there is a 
variable v G var(F) such that for both e G Dy = {0, 1} wc have #(i,.e)(F) < d{F). 
The proof is based on the characterisation of saturated minimally unsatisfiable 
boolean clause-sets in Corollary 2.5.3| and uses 5{F) > 1 for F' G MU, where F' is 
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obtained from F by applying suitable partial assignments ip with n{ip) — 1. There 
are various possibilities to obtain a generalisation for generalised clause-sets (the 
problem is that saturation is not that powerful anymore): 



• in Lemma 2.5.4 we obtain the generalisation to generalised clause-sets in the 
special case of deficiency one, 



while in Subsection 2.5.3 we consider the class of minimally unsatisfiable 



clause-sets stable under application of partial assignments with at most one 
variable, for which then the general bound can be shown (in Corollary 2.5.9] ). 



The existence of a variable v in the boolean case with #(i,.o)(^): i^{v,i){F) ^ ^{P) 
yields that the minimal variable-degree of F is at most 2S{F). Even for lean 
(boolean) clause-sets this can be strengthened considerably, as shown in ]6q|, wh ile 
the proper generalisation to generalised clause-sets is open; see Corollary |2.5.10 for 
a first result in this direction. 



2.5.2 Characterisation of the basic case of deficiency one 

Generalising the tree construction from (exploiting a formula class introduced 
by Stephen Cook and communicated to me by Alasdair Urquhart), let a deficiency- 
1 tree representation (in the remainder of this section just called "tree represen- 
tation") be a 4-tuple {T,r,v,e), where 

• (T, r) is a finite tree with root r (inner nodes (that is, nodes which are not 
leaves) can have an arbitrary number of children). 

• V labels each inner node w of (T, r) with a unique variable v{w). 

• e labels each edge e leading from a node w to a node w' (edges are directed 
from the root towards the leaves) with a value e{e) G such that the 
labelling of the edges going out from w yields a bijection to D^^^y 

If an order on the value set I?«(u,) is given, then also the outgoing edges are ordered 
by the same order; in the special case of boolean variables thus we can speak of 
"left" and "right" branches, corresponding to the positive and negative literal. An 
example R is given as follows. 




This tree representation R uses six variables a, . . . ^ f with Da = Dc = {0, 1, 2}, 
Df, — Df — {0, 1} (thus b, f are boolean variables), and Dd ~ ~ {0}. 

Given a tree representation (T, r, v, e), to every node w of (T, r) we associate a 
clause C„ by considering the path wg, Ci, Wi, . . . , e^,, Wm from the root to w in T 
(thus Wo — r, Wm — w, and the are the connecting edges from Wi-i to Wi, while m 
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is the length of the path), and setting '■— {{'vi'Wi)^ e(ei+i)) : i G {0, . . . , to — 1}}. 
The clause-set F{T, r, v, e) is defined as the set of all clauses for leaves w of 
(T, r). For the above example R we get 

F{R) = { {a ^ 0, & ^ 0}, {a ^ 0, 6 7^ 1, e ^ 0}, 

{a/l,c^O}, {a7^1,c^l}, {a^lc^2}, 

{a7^2,d^0,/^0}, {a^2,d^0,/7^1}}. 

We list some basic properties of the clause-sets F{T, r, v, e): 

1. The rooted tree (T, r) yields a resolution tree for i^(r, r, v, e) by labelling the 
nodes w with clauses Cw and considering the variables v{w) for inner nodes w 
as resolution variables; since = -L we see that F{T,r,v,e) is unsatisfiable. 

2. F{T, r, V, e) is a 1-regular hitting clause-set (for two different clauses C^i , C-w^ 
the unique clashing variable is v(wq) for the root wq of the smallest sub- 
tree of (r, r) containing wi and W2). It follows that F(T,r,v,e) is saturated 
minimally unsatisfiable. 

3. S{F{T, r, V, e)) — 1, since c{F{T, r, v, e)) is the number of leaves of (T, r), while 
wn(_F'(r, r, w, e)) is the number of edges of T minus the number of inner nodes 
of {T,r), and thus S{F{T,r,v,e)) is the difference of the number of vertices 
and the number of edges of T, which is 1 for every tree. 

4. If n{F{T, r, v, e)) > (that is, if (T, r) is not trivial), then we have: 

(a) There is exactly one variable occurring in every clause of F(T, r, v, e) 
(namely v{r)). 

(b) Every clause C £ F{T,r,v, e) contains a literal x G C with i^x(F) = 1 
(namely with var(a;) = v{wo), where C — and wq is the parent node 
of w). 

(c) There exists a variable v G va,i{F{T,r,v,e)) such that for all values 
e G Dy we have (F(T, r, u, e)) = 1 (choose v — v{w) for an inner 
node w of (T, r) such that all children of w are leaves) . 

We can read off many more properties of F{T,r,v,e) directly from the tree repre- 
sentation, for example the minimal resp. maximal clause-length is the minimal resp. 
maximal depth of a leaf, but we need here only the above listed properties. Using 
TZHIT for the set of regular hitting clause-sets and hd for the hitting degree, as 
introduced before, we have F{T, r, v, e) G 7^HI7^d*^M=l■ 

We say that a clause-set F' G CCS is obtained from F{T, r, v, e) by literal elim- 
ination if F' is obtained from F{T,r,v,e) by eliminating some literal occurrences 
(at least one) without ever creating a pure variable. Replacing "F{T, r, v, e)" by F' , 
are still valid, while Properties |2[ ^ are lost: F' is definitely 
not a hitting clause-set anymore, and there does not need to exist a variable occur- 
ring in every clause. It is furthermore F' definitely not saturated anymore (by the 
definition of F')^ however F' is still minimally unsatisfiable (since removal of any 
clause either creates a pure variable or removes the only clause). 

In Lemma C.5 from [|4j it is shown that the boolean elements of SM.Us=i are 
exactly the clause-sets F(r, r, u, e) using only boolean variables, while the elements 
of MUs=i \ SMUs^i are exactly the clause-sets obtained from such F{T,r,v,e) 
by literal elimination. To generalise this characterisation, the following lemma is 
central (compare Property |4c| from above). 



Properties nl 0, 
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Lemma 2.5.4 For every (generalised) clause-set F G AiUs=i with n{F) > there 
exists a variable v e var(_F) such that for all e £ Dy we have =f^(v.s)iF) — 1. 

Proof Consider F G M-Us=i- We investigate the structure of Q{F) (recall Section 



2.3). As we remarked in Subsection 2.3.2, we have S{Q{F)) = 1, and thus by 



Lemma 2.3.3 we have Q{F) G AiUs^i- Since 8(F) is a boolean clause-set, we can 
conclude that &{F) is obtained by literal elimination from some tree representation 
{T,r,v,e) as defined above (using only boolean variables). Q{F) always has the 
following special properties: 

(i) Q{F) is a PN-clause-set, that is, every clause is either positive or negative. 

(ii) For every negative clause N G 8(F) we have Vx G iV : #2;(F) ~ 1 (recall that 
the negative clauses are the ALO-clauses introduced by the translation 8). 

Call a boolean F G MUs=i special if these two conditions are fulfilled. (These 
"special" boolean clause-sets constitute exactly the image Q{A4Us=i) of the trans- 
lation, but we do not need this simple fact here.) Consider a tree representation 
(T, r, V, e) of a special F; obviously also all clause-sets given by the subtrees of (T, r) 
are special again. Now we proof by induction on the height of the tree representa- 
tion of special formulas F with n(F) > that there always exists a negative clause 
N £ F such that Vx G iV : #^(F) = 1, using the standard complement notation 
for boolean literals here; this proves the lemma by definition of the translation 8. 



If the height of (F, r) is 1, then F is {{v{r)}, {v{r)}}, and the assertion is true. 
So assume the height of (T, r) is greater than 1, and consider the left subtree Tq and 
the right subtree Ti of T with associated special Fo,Fi G AiUs=i- If Tq is not the 
trivial tree (has more than one node), then by the induction hypothesis there exists 
a negative clause (non-empty) A^o G Fo with \f x G A'o : #^(Fo) = 1. Now we must 
have A^o G F, since otherwise A^o U {v} G F, where this clause is neither positive 
nor negative; using N := Nq proves the assertion (since none of the variables in 
A^o occurs in Ti in this case). So the remaining case is that Tq is the trivial tree. 
Again by the induction hypothesis there is a negative clause (non-empty) A^i G Fi 
with Va; G A^i : #x{Fi) = 1. Either we have AT := A^i G F or A^ := A^i U {v} G F, 
proving the assertion (in the second case due to the triviality of To). I 



Now we are able to generalise Lemma C.5 in (Part (y) of Theorem 2.5.5 has 
been shown for boolean clause-sets in p^): 

Theorem 2.5.5 The class AiUs=i of minimally unsatisfiable (generalised) clause- 
sets of deficiency 1 has the following two characterisations: 

(i) For F G CCS we have F G MUs=i if and only if F can he reduced to the 
clause-set {_L} by applying non- degenerated singular DP-reduction (as long as 
possible, in any order). 

(ii) A4bls=i is the class of all clause-sets F{T,r,v,e) together with all clause-sets 
F' derived by literal elimination from such clause-sets. 



Proof Par t (p|) f ollows from Lemma [2.5.4 together with Lemma 1.6.1 (Part I) 



and Lemma 1.11.6, Part [2(b)i (also Part I). For Part ^ it remains to show that 



every F G MUs=i can be obtained from some F(T,r,v,e) by a (possibly empty) 
sequence of literal eliminations. We show this by induction on n{F). If n{F) — 0, 
then F = {-L}, and we can take the trivial rooted tree. So assume n{F) > 0. By 
Lemma p. 5. 4 there exists a variable v G var(F) such that for all e G Dy we have 



#(!;,£) {F) = 1; let Ce G F be the unique clause with {v, e) G C^. Thus v is a singular 
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DP- variable w.r.t. F. Let G := DP„(i^); we have G^{F\ {CejeeoJ U {R}, where 
R = [j^fzD (C \ {(«,£)}). As already argued for Part ([]) we have G £ MlAs=i, 
and thus we can apply the induction hypothesis to G; the assertion follows now 
immediately by extending the tree representation of G at the leaf labelled by R by 
adding new leaves G^ for e e ZJ^,. I 



Theorem 2.5.5 yields also two further poly-time decision procedures for the class 
AiUs=i (while two general poly-time decision procedures for the classes AAUs=k for 
fc G N are given by Corollary 1.8.7 (Part I) and Theorem 2.3.5). To conclude, we 
characterise the saturated and the marginal elements of M.Us=i. 



Corollary 2.5.6 The class SAiUs^i of saturated minimally unsatisfiable (gener- 
alised) clause-sets of deficiency 1 is exactly the class of all clause-sets F{T,r,v,£). 
It follows that the following conditions are equivalent for a clause-set F S CCS: 



1. F = F{T,r,v,e) for some deficiency-1 tree representation (T,r,v,e). 

2. F is an unsatisfiable 1-regular hitting clause-set of deficiency 1 (i.e., F G 

3. F is an unsatisfiable regular hitting clause-set of deficiency 1 (i.e., F e 

4. F is an unsatisfiable hitting clause-set of deficiency 1 (i.e., F G 'Hnj^\~^). 

5. F is a saturated minimally unsatisfiable clause-set of deficiency 1 (i.e., F 6 
SMUs=i). 



If a minimally unsatisfiable clause-set is hitting, then it is saturated; Corollary 



2.5.6 prov es the reverse for deficiency 1 (which does not hold for higher deficiencies). 
Corollary 2.5.t show s that SMUs=i covers some unsatisfiable regular hitting clause- 
sets — in Corollary 2.6.7 we will see that it actually covers all of them. 

While saturated minimally unsatisfiable clause-sets do not allow addition of any 
literal occurrence to any clause without destroying the property of being minimally 
unsatisfiable, on the other end of the spectrum we have marginal minimally un- 
satisfiable clause-sets, which are minimally unsatisfiable clause-sets such that 
removing any literal occurrence destroys the property of being minimally unsatisfi- 
able. 



Corollary 2.5.7 The class A4A4lAs=i of marginal minimally unsatisfiable (gener- 
alised) clause-sets of deficiency 1 is exactly the class of all F 6 MUs=i for which 
no further literal eliminations are possible, which is equivalent to the property that 
F is totally singular (recall Subsection 1.9.1 from Part I), that is, for every variable 
V G var(i^) and every e G Dy we have #(i,.e)(i^) = 1- 

Thus AiAilAs=i is the set of totally singular elements of A4Us=i. Actually, 
MMUs=i is the set of totally singular elements of all of M.IA. 



Proof If for a minimally unsatisfiable clause-set F every literal in it occurs exactly 
once, then obviously it is marginal; by Theorem 2.5.5 we obtain the reverse direction 
(since if for some value more than one occurrence of a variable is left, then literal 
elimination is still applicable). 

If on the other hand a totally singular F is minimally unsatisfiable, then it 
is minimally matching unsatisfiable, and thus by Corollary |1.9.11| (Part I) it has 
deficiency I. I 
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We remark that in it is shown that in the boolean case the class of conflict 
graphs of F G J^Us=i is exactly the class of all connected graphs, while the conflict 
graphs of saturated (boolean) F G MUs^i are exactly all complete graphs, and the 
conflict graphs of marginal (boolean) F G A4Us=i are exactly all trees; furthermore 
a boolean element of AiUs=i is saturated resp. marginal iff the conflict graph is 
complete respectively a tree. And a totally singular multi-clause-set is minimally 
unsatisfiable iff its conflict multigraph is a tree. 

Finally we remark that in general the decision problem whether a (generalised) 
clause-set is saturated resp. marginal is D^-complete as shown in (there for 
boolean clause-sets, which obviously immediately generalises). 



2.5.3 Stability parameter and minimal variable degree 

Let us conclude the chapter by some considerations which summarise certain ob- 
servations we made, and which could serve as basis for further investigations. 

For a multi-clause-set F let sir(i^), the (substitution) stability parameter 
regarding irredundancy, be the supremum in Z>_i U {+00} of n £ No such that 
for all (f e VASS with n{(p) < n the multi-clause-set (p* F is irredundant. We have 
the following basic properties. 



1. sir(F) = — 1 iff F is redundant, sir(F) > iff F is irredundant. 

2. sir(F) = +00 iff sir(F) > n{F) iff F is a hitting clause-set (see Corollary 

EH). 

3. Assume that F is unsatisfiable. Then sir(F) > 1 => F e SMU by Corollary 
2.5.3, and for boolean F we have equivalence. 

For n(F) > let the min-max var-degree resp. the minimal var-degree be 

defined by 

mmvd(F) := min max #/„ ^1 (F) e N 
mvd(F) := min #^,(F) e N. 

i)Gvar(F) 

An upper bound mmvd(F) < k says that there exists a variable v such that for all 
values £ of w we have at most k occurrences of v with that "polarity" . So Lemma 
2.5.4 now can be reformulated as the statement \/ F E AiUs^i : F ^ {_L} ^ 
mmvd(F) — 1. 



Lemma 2.5.8 Consider a generalised multi- clause- set F and a variable v S var(F) 
which is not pure for F (i.e., W e € Dy : #(t,.£) (F) > 1), such that ^^{F) = mvd(F). 
Then we have for e £ Dy the following. 



1. 5{{v e) * F) ^ 5{F) - S(^,_j)(F) + \Dy\-l. (Compare Lemma \l.ll.4 , Part 
^ from Part I.) 

2. Assume sir(F) > 1 and that F is unsatisfiable. 

(a) S(y^,){F)<5{F) + \Dy\-2. 

(b) If V is non-trivial (i.e., \Dy\ > 2), then #(„,e)(F) < S{F). 
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Proof Part |i| follows by the observation that var((w s) * F) = var(F) \ {v} (if 
another variable w would vanish, then every occurrence of w would be in a clause 
C with some {v,e') € C for e' e £)„ \ {e}, and so #w{F) < S(„_£)(F) = #v{F) - 
#{v,e)iF) < #v{F) = mvd(i^) < #^o{F)). For Part | we have sl{v ^ e) * F) > 1, 
and thus Part ^ follows. For Part ^ consider s' G Dy \ {s}. By Part ^ we have 

S(„.e')(^) <'5(F) + |i?, I -2, where S(„.,,)(F) (F) + Ee"eDA{e > 

#(„,,)(F) + |I?,|-2. I 

Corollary 2.5.9 For an unsatisfiable generalised clause-set F with sir(i^) > 1, 
n{F) > 0, we have 

mmvd(i^) < S{F). 

Proof Eliminating all trivial variables from F we obtain the clause-set F' with 
S{F') = ^iF) and c{F') — c{F); now the assertion follows by Part ^ of Lemma 

Since every minimally unsatisfiable (generalised) clause-set can be saturated 



(see Lemma 2.5.1), and every boolean saturated minimally unsatisfiable clause- 
set F fulfils sir(i^) > 1, we get for arbitrary boolean F G MU the upper bound 
mmvd(F) < S{F) (as shown in pj|). For generalised minimally unsatisfiable clause- 



sets we showed this upper bound in Lemma 2.5.4 for the simplest case S{F) = 1, 
while the general case is open. 

Corollary 2.5.10 Consider an unsatisfiable generalised clause-set F with sir(_F') > 
1 and n{F) > 0, and let D d N be the maximal domain-size. Then we have 

uwd{F) < D-S{F). 

For boolean clause-sets, a stronger and also more general bound is shown in pqj. 



2.6 The conflict structure of generalised clause- 
sets 



In this final section we conclude the basic combinatorial theory of generalised clause- 
sets by regarding their conflict structure, continuing ^ |5J]. A fundamental 
tool again is translation to boolean clause-sets, but introducing here a new trans- 
lation in Subsection 2.6.1, which, different from the direct translation, leaves the 



conflict structure invariant. Then in Subsection 2.6.2 the basic facts from linear 



algebra regarding the "hermitian rank/defect" are summarised and generalised to 
generalised clause-sets, discussing the close relation to the theory of multiclique 
partitions of multigraphs and the applications to regular hitting clause-sets. 

Recall from Subsection 1.3.4 (Part I) that an r-regular hitting clause-set F is 



a (generalised) clause-set F such that any two different clauses clash in exactly 
r £ No clauses. Using the conflict multigraph cmg(F) of F, the multigraph with 
the clauses of F as vertices and as many edges between vertices as the clauses have 
conflicts, we see that F is r-regular hitting iff cmg(i^) is isomorphic to r • K^^p-^, 
where Km denotes the complete graph with m vertices, while the factor r makes r 
parallel edges out of every edge. The conflict matrix CM(i^) of F is the adjacency 
matrix of cmg(i<"), that is, the square matrix of order c{F) where the entry at 
position (i, j) denotes the number of clashes between clauses i and j. Every conflict 
matrix is symmetric, non- negative with a zero diagonal and with integral entries; 
F is r-regular hitting iff all entries of CM(F) except of diagonal-entries are equal 
to r. 
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2.6.1 A new boolean translation 



We introduce a new translation of generalised multi-clause-sets F into boolean 
clause-sets F t-^ 6n(F), called the n este d translation, with the advantage over 
the direct translation (recall Section \i.3\) that the conflict structure is preserved, 
that is, we have CM{F) — CM(6n(i^)); the price we have to pay here is the intro- 
duction of a "nesting structure" , which breaks the symmetry between the different 
values of a variable. 

Consider a variable v with domain Dy = {ei, . . . , Ek} (where k — |-Dt,|). The 
conflict multigraph of the clause-set {{v Si} : i ^ {f,...,fc}} £ SM.Us=i is 
isomorphic to the complete graph Kk with k vertices. The nested translation 
is based on the following exact realisation of this conflict graph by a boolean 
clause-set^, namely by the minimally unsatisfiable boolean Horn clause-set Hy — 
{£",,,1, . . . , Etj^u} G SM^Us=i with fc — 1 variables wi, . . . , Vk^i, where 



, Vi^i.Vi] \ii <k 
, wFT} if i = fc 



For example for fc = 4 we have the 3 boolean variables variables vi,V2, and the 
clauses E^^i = {vi}, £'„,2 = {vi,V2}, E^^s = {wTjWjWs} and Ey^^ = 

Now the nested translation F H> Qn{F) replaces iteratively one variable v with 
domain size ky = \Dy \ > 2 by boolean variables ui, . . . , Wfe„_i, without changing the 
number of clauses or their conflict structure, but enlarging clauses. The replacement 
of a single variable v, a transition F i-^ Fy repeated for all variables with at least 
three values, happens as follows: 



For i G {1, . . . , fc} let Fi be the sub-multi-clause-set of F collecting all occur- 
rences of literal (w,£i) (thus c{Fi) — #(v.Ei){P))- 



• Consider new boolean variables vi, . . . ,Vk-i and the Horn clause-set Hy = 
{Eys, ■ ■ ■ , Ey,k} as above; the meaning of Vi is related to "w 7^ £;". 

• Now for i G {1, . . . , fc} replace each clause C £ Fihy clause {C\{{v,ei)})LiEy^i 
(replacing the literal (f,£i) by the literals from Ey^i). 

• For the obtained multi-clause-set Fy by definition we have 

c{Fy) = c{F) 
GM{Fy) = CM{F), 

and if F does not contain pure variables, then we have d{Fy) — 5{F), while 
in general we have S{Fy) > S{F). 

In order to keep Qn{F) small (that is, to keep the number of literal occurrences 
small), one can order the domains Dy according to descending number of occur- 
rences (so that the bigger clause-enlargements occur less often), however in our 
general context such considerations are not of importance. We consider now a sim- 
ple example, using two variables a,b with Da = {0,1,2,3} and Db = {0,3,2,1} 
(using these orderings for the translation), while 

F {{(a,0)}, {(6,0)}, {(a,0),(6,0)}, 

{(a,l),(fe,l)}, {(a, 2), (6, 2)}, {(a, 3), (6, 3)} }. 



other minimally unsatiafiablc realisations lead to other translations, which could be used here 



as well; see Subsection 2.7.3 
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We have c(F) = 6, n{F) = 2, wn(F) = 2 • (4 - 1) = 6 and 5{F) =6-6^0. The 
new variables are vq, vi, V2 for a, and wq, wi, W2 for fe, and the translated clause-set 
is 



0n(-F) = { [va], {wq}, {vq,Wo}, 

We have c(e„(F)) = 6, n(e„(F)) = 6 and (5(e„(F)) = 0. Note that 5*{F) = 
(since F is matching satisfiable), but 5*(9„(i^)) = 1, while the matching-lean kernel 
of e„(F) is {{wo},{wo},{t;o,wo}}. 

The idea for this translation is taken from the reduction of multiclique parti- 
tions to biclique partitions as observed in [ psf — another example, where a simple 
graph-theoretical observation extends to a fuller statement of (generalised) logic. 
Regarding the properties of the nested translation, we focus on the most basic prop- 
erties here. We already noticed that we always have c(8n(f )) = c(F), while for the 
number of formal variables (which might not occur) we have n(0n(f )) — wn(i^). 
By definition we get: 

Lemma 2.6.1 For multi- clause- sets Fi,F2 we have 

e„(i^i + ^^2) = e„(Fi) + e„(F2), 

given the same choice of translation variables and the same nesting order. Thus for 
a multi- clause- set F and F' < F we have 0n(-F") < Qn{F)- 

Lemma 2.6.2 For a generalised multi-clause-set F we have: 

1. CM(en(i^)) = CM(F). 

2. 5{Qn{F)) > (5(F); if F has no pure variables, then ^(e„(F)) = 6{F). 

3. Qn{F) is satisfiable if and only if F is satisfiable. 

4. Qn{F) is minimally unsatisfiable if and only if F is minimally unsatisfiable. 

Proof Properties |^, ^ follow from the remarks above. For Property ^ first assume 
that F is satisfiable with satisfying total assignment (p. Consider v e var(F) with 
ip{v) = Si and the boolean variables wi, . . . ,Vk-i as used above in the definition 
of &n{F)- Now for J 6 {1, . . . , fc — 1} assign Vj 1 for j ^ i, while for j ~ i let 
Vj — ?> 0. The total assignment ip' obtained in this way satisfies 9n(F). For the other 
direction consider a satisfying total assignment ip for 9n(F), and for v G var(F) let 
ip'iv) := Si for the smallest i e {1, . . . , fc} such that i < k ^ 'fiivi) — holds; again 
by definition we see that ip' satisfies F. Finally Property ^ follows with Property ^ 



and Lemma 2.6.1 



2.6.2 The hermitian defect, and regular hitting clause-sets 

The hermitian rank h{M) of a symmetric real matrix of order m (which has only real 
eigenvalues) is the maximum of the number of positive and the number of negative 
eigenvalues of M, while the hermitian defect of M is <5h(M) := m — h{F), which 
is the Witt index of the associated bilinear form; for more on this and its relation 
to (boolean) satisfiability problems see |4|, |2|. We use h{F) /i(CM(F)) and 
(5h(F) :— ^h(CM(F)) for (generalised) clause-sets F. In Q it was shown that 
boolean clause-sets F fulfil d{F) < 6h{F), which is just a translation of the well- 
known basic Graham-PoUak theorem about biclique partitions of multigraphs. Since 
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for regular non-empty hitting clause-sets F we have 5\i{F) = 1, it follows 5{F) < 1 
for (boolean) regular hitting clause-sets. 

By Lemma ^.6.2 , Part |l| we have that 



/i(e„(F)) = h{F) 

for multi-clause-sets F. Thus by Theorem 13 in p3 we get 



(2.1) 



Lemma 2.6.3 SAT decision for generalised clause-sets F with h{F) < 1 can be 
done in polynomial time (and for example by self-reduction also satisfying assign- 
ments can be found). 

Regarding the main subject of this section, from ( p^ ) we obtain S{F) < S{Qn{F)) < 
Si,{en{F)) = 5h(F): 

Theorem 2.6.4 For a generalised multi-clause-set F we have S{F) < Sii{F). 



The proof of Theorem 2.6.4 works by using the nested translation, and it is not 
clear how to generalise the notion of hermitian rank to obtain a direct proof. 

Corollary 2.6.5 For a generalised clause-set F which is regular hitting we have 
(5(F) < 1. 



In the terminology of graph partitions. Corollary 2.6.5 generalises "Witsen- 
hausen's Theorem" , the special case of the Graham-Pollak Theorem asserting that 
every biclique partition of a complete graph Km needs at least m — 1 bicliques: 

• A multiclique in a graph is a subgraph (not necessarily induced) which is a 
complete fc-partite graph for some k. 

• A multiclique partition of a multigraph consists of edge-disjoint multicliques 
covering every edge. 

• Generalised clause-sets are multiclique partitions of their conflict multigraphs, 
where the multicliques correspond to variables, with the variable-values and 
their occurrences corresponding to the parts of the multiclique. 



• Thus in Corollary 2.6.5 we allow to partition the edge set of r ■ Km into 
complete multipartite graphs, where every complete fc-partite component (k 
not fixed) contributes the "cost" fc — 1, and Corollary 2.6.5 now says that the 
total cost must be at least m — 1. 

• Allowing only constant fc = 2 is the Theorem of Witsenhausen, allowing only 
constant fc = m is trivial, and for arbitrary constant fc compare Example 1.1 

in Hi. 

Let us consider an example here, which also shows that multiple edges can help to 
achieve a higher deficiency (i.e. using less variables). Let the multigraph G be 



G 



1 




6 



where the edge between vertices 2 and 5 has multiplicity 2. An optimal biclique par- 
tition of the edge sets is given by the two bicliques ({1, 2}, {4, 5}) and ({2, 3}, {5, 6}); 
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a corresponding ( "exact" , boolean) clause-set F with deficiency 6 — 2 = 4 has clauses 
Ci,...,C6 and variables vi,V2, where variable vi occurs positively in Ci,C2 and 
negatively in C4, C5, while variable V2 occurs positively in C'2, C3 and negatively in 
C5, Cg. The underlying graph has adjacency matrix 

/O 1 1 0\ 

1 1 1 

1 1 

1 1 
1110 

yo 1 1 0/ 

with hermitian rank 3 (it has eigenvalues ±1 each with multiplicity 3), and thus by 



Theorem 2.6.4 every multi-clause-set with this conflict matrix has a deficiency of at 
most 6 — 3 = 3, i.e., 3 variables are needed (which obviously also suffices here (so 
this graph is "eigensharp")). 

Unsatisfiable hitting clause-sets are minimally unsatisfiable, and thus have defi- 
ciency at least 1; it follows: 

Corollary 2.6.6 Unsatisfiable regular hitting clause-sets have deficiency exactly 1, 
i.e., 7^-HXr'''*=° = 7^HI7lf^ °. 

Generalising Corollary 34 in we can now precisely characterise all unsat- 
isfiable regular hitting (generalised) clause-sets. An unsatisfiable regular hitting 
clause-set with at least two clauses can not be /c-regular ioi k 1, since otherwise 
it would be satisfiable due to the completeness of resolution. Thus by Corollary 



2.5.6 we get: 



Corollary 2.6.7 TL'HIT^'^'^ " = SJ\4Us=i, that is, unsatisfiable regular hitting 
clause-sets are exactly the satur ated m inimally unsatisfiable clause-sets of deficiency 



1 as characterised in Corollary 2.5.6 



An interesting aspect of unsatisfiable boolean regular hitting clause-sets has been 
revealed in , where the basic setting translated to our language and generalised 
to generalised clause-sets reads as follows: 

1. An unsatisfiable hitting clause-set F corresponds to a partitioning of the space 
of all total assignments n-uevar(F) ^^^'^ disjoint "cubes" , where a cube is 
a subset of ni)evar(F) where some variables have a fixed value while the 
values of all other variables are arbitrary. 

2. 1-regularity of F means that any two different cubes in the corresponding 
cube-partition have distance (exactly) 1, where the metric on ri)jevai(F) 

the Hamming distance (counting the number of different positions) , and the 
distance between two subsets is the minimal distance of their members. 

3. Thus 1-regular hitting clause-sets F correspond to "neighbourly cube parti- 
tions" of n^evarCF)^!-- 



Now gave first a "combinatorial" direct proof of Corollary p.6.7| for the boolean 
case (that is, of Corollary 34 in j49|), and then showed that by adding literals 
with new variables to the leaves of the tree representations of the elements of 
SAiUs=i{VA{o^ij) one obtains exactly those boolean clause-sets F which have ex- 
actly the maximal number of prime implicates possible (for a given number of 
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clauses), namely 2'^*^^^ — 1 (see Corollary 2.4.6). These considerations are gener- 
alised in since (generalised) clause-sets have a too restricted language here, in 
order to obtain the maximal number of prime implicates the full power of signed 
clause-sets is needed (that is, the appropriate notion of prime implicate here actually 
must allow signed literals). 



2.7 Conclusion and open problems 

The main purpose of this series of two articles was to set the stage for the study 
of generalised clause-sets as sets of "no-goods", where literals are given by one 
"forbidden value" : 

• We defined and summarised the basic properties of syntax, semantics, resolu- 
tion calculus and autarky systems. 

• Then we considered the generalisation of the notion of deficiency for these 
generalised clause-sets, and we studied the basic autarky system related to 
this notion, namely matching autarkies. 

• We showed fixed parameter tractability of satisfiability of generalised clause- 
sets in the maximal deficiency, while the computation of a maximal autarky 
(yielding the lean kernel) can be done in polynomial time for fixed maximal 
deficiency. 

• For autarky systems both the application of autarkies as reductions and the 
properties of autarky-freeness, i.e., lean clause-sets, are of interest. 

• Lean clause-sets are a generalisation of minimally unsatisfiable clause-sets, for 
which we considered the basic problem, when the property of being minimally 
unsatisfiable is preserved under application of partial assignments, and we 
characterised also minimally unsatisfiable clause-sets of minimal deficiency. 

• More generally, also irredundant clause-sets have been considered. 

• Besides using the generalised tools transferred from the boolean case, also the 
structure preserving properties of boolean translations are important, and we 
investigated basic cases. 

• The notion of deficiency introduced here for generalised clause-sets is funda- 
mental for many of these considerations, and we considered also the (close) 
relation to multiclique partitions of multigraphs and related areas. 



2.7.1 Minimally unsatisfiable clause-sets of low deficiency 



Having generalised the characterisation of mini mally unsatisfiable clause-sets of defi- 
ciency one from the boolean case in Subsection ^2.5.2 , the next question concerns the 
generalisation of the structure of boolean A4l4s=2 as studied in |Q. This generali- 
sation seems to be not straightforward, but we believe that minimally unsatisfiable 
generalised clause-sets of deficiency two are still quite close to the boolean case 
(while from deficiency three on generalised clause-sets behave more wildly). 

A key tool for the study of boolean minimally unsatisfiable clause-sets is the 
observation in |^ that for every boolean minimally unsatisfiable clause-set F with 
n{F) > there exists a variable v € var(F) such that for both e € {0, 1} we have 



i^{v,e){P) ^ see Lemma 2.5.8 for a discussion of this subject. As applied in 

the proof of Lemma 2.5.4, the direct boolean translation is a suitable tool here; 
additionally the nested translation from Subsection ^.6.1 could be useful. 
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2.7.2 Conflict combinatorics 



As touched upon in Lemma 2.4.12| , the SAT problem for bihitting clause-sets is 



essentially the same as the hypergraph transversal problem, and whether the latter 
problem can be decided in polynomial time is a long outstanding open question. Be- 
ing optimistic about the potential of (generalised) clause-sets to provide a unifying 
framework for (hard) graph and hypergraph problems, we propose: 

Conjecture 2.7.1 Satisfiability decision for multihitting (generalised) clause-sets 
can be done in polynomial time. 



Since the nested translation (recall Subsection 2.6.1) maintains the hermitian 



rank (as well as the hermitian defect), Conjecture 15 from is equivalent to 

Conjecture 2.7.2 Satisfiability decision for generalised clause-sets can be done in 
polynomial time for bounded hermitian rank. 

See also Section 5.2 in for more specialised conjectures regarding the structure 
of boolean clause-sets of hermitian rank 1 (which can be generalised to generalised 
clause-sets). 

One of the early problems in the area of addressing graphs, the "squashed cube 
conjecture" solved by Peter Winkler (see Chapter 9 in |8^), is generalised by the 
following conjecture (where a clause-set F is called exact, if its deficiency is maximal 
amongst all clause-sets with the same confiict matrix (or conflict multigraph)): 

Conjecture 2.7.3 Consider an exact clause-set F , whose conflict matrix is the 
distance matrix of some connected graph. Then F is matching lean. 

What Winkler originally proved is equivalent to the statement that 5{F) > 1 holds 
under these assumptions (for non-empty F). 

The Alon- Saks- Seymour- Conjecture ("Every graph G which can be written as 
a union of m edge-disjoint complete bipartite graphs has x(G) < m + 1.") can be 



reformulated in our context as follows, where for consistency with Theorem 2.6.4 
we introduce the "colouring deficiency" S)^{F) :— c{F) — x(cg(F)) of a clause-set F 
(where x(G) is the chromatic number of graph G, the minimal number of colours 
needed for a proper vertex colouring): 

Conjecture 2.7.4 For every clause-set F with at most one confiict between any 
two clauses we have 6{F) < 6^{F) + 1. 

Since regular hitting F have S^{F) = 0, this would yield a generalisation of Corollary 



2.6.5 (for 1-regular hitting clause-sets) in a different direction. The restriction for F 



in Conjecture 2.7.4, which just states that the conflict multigraph of F in fact is a 
graph, seems necessary, since colouring of multigraphs only considers the underlying 
graph, while allowing parallel edges can increase the deficiency. 

We remark that w.r.t. the computation of the biclique partition number bcp(G) 
of a multigraph G, which is the minimal n{F) for boolean clause-sets F with 
cmg(F) = G, there are basic open questions: 

• It is known that the decision "bcp(G) < k" is NP-complete, where k is part 
of the input. 
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• This has been shown in [^: for graphs G without 4-cycles bcp(G') is the 
vertex-cover-number (the special case of the transversal number for hyper- 
graphs, applied to graphs — the smallest number of vertices hitting every 
edge), since the only bicliques in such graphs are stars, while for such graphs 
the vertex-cover-number problem (or, equivalently, the independence-number 
problem) is NP-complete. 

• Now for fixed k we should have polynomial time decision of "bcp(G) < k T\ 
while to achieve fixed parameter tractability (in k) seems more challenging. 

Instead of computing the minimal n[F) for boolean F with cmg(_F) = G (then we 
have n{F) = bcp(G)) we can also compute (equivalently) the maximal 5{F) for such 



F, a point of view which seems to have some advantages, and where by Lemma 2.6.2 
we can also allow all generalised clause-sets here (since the weights for the variables 
counterbalance that variables with larger domains are more powerful). However 
considerations as in [|8| ask for the minimal n{F) for generalised clause-sets F with 
uniform domain size d (a further parameter) such that cmg(i^) = G, and then we 
need to consider only generalised clause-sets where all variables have domain size d. 



2.7.3 Translations to boolean clause-sets 



In Subsection 2.6.1 we have introduced the nested translation F H> 6n(^') from 



(multi-)clause-sets to boolean (multi-)clause-sets. The idea can be generalised as 



follows, covering then also the direct translation (see Section 2.3), in the "weak 
form" we have used (considered only ALO-clauses) as well as in the "strong" form 
(using also AMO-clauses). We call this scheme the generic translation. 

To every variable v G var(F) one needs to associate two boolean clause-sets 
T{v),T'{v) together with a bijection : Dy — > T{v), such that the following 
conditions are fulfilled: 

• all clause-sets T{v) U T'{v) are unsatisfiable; 

• all clauses of T{v) are necessary in T{v) UT'{v) (that is, removing any clause 
of T{v) renders T{v) U T'{v) satisfiable); 

• for different variables v, w we have var(T('i;) U T'{v)) n var(T(i(;) U T'{w)) — 0. 
Now the translated clause-set Qt,t' .-yiF) is obtained as follows from F: 

1. every clause C € F is replaced by the clause IJ^^ ^-jg^ Ivi^)] 

2. the clauses from the clause-set Ut)evar(F) ^'(^) ^''^ added. 
Lemma 2.7.5 Qt,T' .-yiF) is satisfiability-equivalent to F. 

Proof First consider a total satisfying assignment ip for F. For v S var(i^) choose 
a total satisfying assignment t/jy for {T{v) U T'{v)) \ {ivi^fiiv))}; now U«gvar(F) V'l- 
is a satisfying assignment for Q{F). Now consider a satisfying assignment -0 for 
Q{F). For each v G var(F) choose one clause Cy G F{v) falsified by tp, and let 
(p{v) := j~^{Cv); now (/? is a satisfying assignment for F. I 

The general scheme for handling a variable v with k values can be understood as 
choosing some minimally unsatisfiable boolean clause-set Tg with at least k clauses, 
choosing k clauses from Tq for T(v), and putting the remaining clauses into T'{v), 
where T'{v) can be augmented with further clauses such that the clauses of To 
remain necessary. Four main classes of examples are as follows: 
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To obtain the direct translation, let T{v) :— { {''"((^j £))} Igg^jj^,-) be a set 

of unit-clauses, where 7«(e) :— {T((w,e))}, and let T'{v) :— |{T((u,e)) : 
e € D{v)} } be a singleton clause-set (consisting of the ALO-clause for v). 
For the strong form just add the AMO-clauses to T'{v). So the underlying 
minimally unsatisfiable boolean clause-set (with k variables and fc + 1 clauses) 
is {{vi},...,{vk},{vi,---,v]:}}. 

For the nested translation, let T{v) := Hy and T'{v) := T, while 7^ is any 
bijection between Dy and Hy (using the notations from Subsection ^.6.1 ). To 



obtain a strong form, add all positive binary clauses to T'{v). The underly- 
ing minimally unsatisfiable (saturated) boolean (Horn) clause-set (with k — 1 
variables and k clauses) is { {vi}, . . . , {Wf, . . . , w/c_2, Wfc-i}, {iJT, . . . , W/t-i} }. 

• Using the same underlying minimally unsatisfiable clause-set as with the direct 
translation, however this time using all clauses for T{v), leaving T'{v) empty, 
and so using only fc — 1 variable, we obtain the reduced translation^ which can 
also be obtained from the nested translation by removing all negative literals 
from the first fc — 1 clauses. A strong form is obtained by adding all positive 
binary clauses to T'{v). 

• Finally, to obtain an example of a "logarithmic translation" , assume for sim- 
plicity that we have fc = 2^ for the domain size fc = \Dy\, and let T{v) be the 
full clause-set (all clauses contain all variables) with fc clauses over p variables, 
while T'{v) := T.0 

Please note that for the generic translation every variable can be treated differ- 
ently. A systematic study of the generic translation scheme, which admits now the 
possibility to adopt the translation to the problem at hand, should yield powerful 
tools for solving SAT for generalised clause-sets by boolean SAT solvers. For a first 
empirical study see |M. 



^'Thc simplest form of handling arbitrary k is to consider the smallest p with 2^ > k, to choose 
k full clauses (over p variables) for T(v), and to put the remaining full clauses into T' (v). 
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